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Preface 


This text is based on a course I have taught for many years to first year graduate and 
senior-level undergraduate students at Caltech. One outcome of this teaching has been the 
realization that although students typically decide to study plasma physics as a means to- 
wards some larger goal, they often conclude that this study has an attraction and charm 
of its own; in a sense the journey becomes as enjoyable as the destination. This conclu- 
sion is shared by me and I feel that a delightful aspect of plasma physics is the frequent 
transferability of ideas between extremely different applications so, for example, a concept 
developed in the context of astrophysics might suddenly become relevant to fusion research 
or vice versa. 

Applications of plasma physics are many and varied. Examples include controlled fu- 
sion research, ionospheric physics, magnetospheric physics, solar physics, astrophysics, 
plasma propulsion, semiconductor processing, and metals processing. Because plasma 
physics is rich in both concepts and regimes, it has also often served as an incubator for 
new ideas in applied mathematics. In recent years there has been an increased dialog re- 
garding plasma physics among the various disciplines listed above and it is my hope that 
this text will help to promote this trend. 

The prerequisites for this text are a reasonable familiarity with Maxwell’s equa- 
tions, classical mechanics, vector algebra, vector calculus, differential equations, and com- 
plex variables — i.e., the contents of a typical undergraduate physics or engineering cur- 
riculum. Experience has shown that because of the many different applications for plasma 
physics, students studying plasma physics have a diversity of preparation and not all are 
proficient in all prerequisites. Brief derivations of many basic concepts are included to ac- 
commodate this range of preparation; these derivations are intended to assist those students 
who may have had little or no exposure to the concept in question and to refresh the mem- 
ory of other students. For example, rather than just invoke Hamilton-Lagrange methods or 
Laplace transforms, there is a quick derivation and then a considerable discussion showing 
how these concepts relate to plasma physics issues. These additional explanations make 
the book more self-contained and also provide a close contact with first principles. 

The order of presentation and level of rigor have been chosen to establish a firm 
foundation and yet avoid unnecessary mathematical formalism or abstraction. In particular, 
the various fluid equations are derived from first principles rather than simply invoked and 
the consequences of the Hamiltonian nature of particle motion are emphasized early on 
and shown to lead to the powerful concepts of symmetry-induced constraint and adiabatic 
invariance. Symmetry turns out to be an essential feature of magnetohydrodynamic plasma 
confinement and adiabatic invariance turns out to be not only essential for understanding 
many types of particle motion, but also vital to many aspects of wave behavior. 

The mathematical derivations have been presented with intermediate steps shown 
in as much detail as is reasonably possible. This occasionally leads to daunting-looking 
expressions, but it is my belief that it is preferable to see all the details rather than have 
them glossed over and then justified by an “it can be shown" statement. 
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The book is organized as follows: Chapters 1-3 lay out the foundation of the subject. 
Chapter 1 provides a brief introduction and overview of applications, discusses the logical 
framework of plasma physics, and begins the presentation by discussing Debye shielding 
and then showing that plasmas are quasi-neutral and nearly collisionless. Chapter 2 intro- 
duces phase-space concepts and derives the Vlasov equation and then, by taking moments 
of the Vlasov equation, derives the two-fluid and magnetohydrodynamic systems of equa- 
tions. Chapter 2 also introduces the dichotomy between adiabatic and isothermal behavior 
which is a fundamental and recurrent theme in plasma physics. Chapter 3 considers plas- 
mas from the point of view of the behavior of a single particle and develops both exact 
and approximate descriptions for particle motion. In particular, Chapter 3 includes a de- 
tailed discussion of the concept of adiabatic invariance with the aim of demonstrating that 
this important concept is a fundamental property of all nearly periodic Hamiltonian sys- 
tems and so does not have to be explained anew each time it is encountered in a different 
situation. Chapter 3 also includes a discussion of particle motion in fixed frequency oscil- 
latory fields; this discussion provides a foundation for later analysis of cold plasma waves 
and wave-particle energy transfer in warm plasma waves. 

Chapters 4-8 discuss plasma waves; these are not only important in many practical sit- 
uations, but also provide an excellent way for developing insight about plasma dynamics. 
Chapter 4 shows how linear wave dispersion relations can be deduced from systems of par- 
tial differential equations characterizing a physical system and then presents derivations for 
the elementary plasma waves, namely Langmuir waves, electromagnetic plasma waves, ion 
acoustic waves, and Alfvén waves. The beginning of Chapter 5 shows that when a plasma 
contains groups of particles streaming at different velocities, free energy exists which can 
drive an instability; the remainder of Chapter 5 then presents Landau damping and instabil- 
ity theory which reveals that surprisingly strong interactions between waves and particles 
can lead to either wave damping or wave instability depending on the shape of the velocity 
distribution of the particles. Chapter 6 describes cold plasma waves in a background mag- 
netic field and discusses the Clemmow-Mullaly-Allis diagram, an elegant categorization 
scheme for the large number of qualitatively different types of cold plasma waves that exist 
in a magnetized plasma. Chapter 7 discusses certain additional subtle and practical aspects 
of wave propagation including propagation in an inhomogeneous plasma and how the en- 
ergy content of a wave is related to its dispersion relation. Chapter 8 begins by showing 
that the combination of warm plasma effects and a background magnetic field leads to the 
existence of the Bernstein wave, an altogether different kind of wave which has an infinite 
number of branches, and shows how a cold plasma wave can ‘mode convert’ into a Bern- 
stein wave in an inhomogeneous plasma. Chapter 8 concludes with a discussion of drift 
waves, ubiquitous low frequency waves which have important deleterious consequences 
for magnetic confinement. 

Chapters 9-12 provide a description of plasmas from the magnetohydrodynamic point 
of view. Chapter 9 begins by presenting several basic magnetohydrodynamic concepts 
(vacuum and force-free fields, magnetic pressure and tension, frozen-in flux, and energy 
minimization) and then uses these concepts to develop an intuitive understanding for dy- 
namic behavior. Chapter 9 then discusses magnetohydrodynamic equilibria and derives the 
Grad-Shafranov equation, an equation which depends on the existence of symmetry and 
which characterizes three-dimensional magnetohydrodynamic equilibria. Chapter 9 ends 
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with a discussion on magnetohydrodynamic flows such as occur in arcs and jets. Chap- 
ter 10 examines the stability of perfectly conducting (i.e., ideal) magnetohydrodynamic 
equilibria, derives the ‘energy principle’ method for analyzing stability, discusses kink and 
sausage instabilities, and introduces the concepts of magnetic helicity and force-free equi- 
libria. Chapter 11 examines magnetic helicity from a topological point of view and shows 
how helicity conservation and energy minimization leads to the Woltjer-Taylor model for 
magnetohydrodynamic self-organization. Chapter 12 departs from the ideal models pre- 
sented earlier and discusses magnetic reconnection, a non-ideal behavior which permits 
the magnetohydrodynamic plasma to alter its topology and thereby relax to a minimum- 
energy state. 

Chapters 13-17 consist of various advanced topics. Chapter 13 considers collisions 
from a Fokker-Planck point of view and is essentially a revisiting of the issues in Chapter 
1 using a more sophisticated point of view; the Fokker-Planck model is used to derive a 
more accurate model for plasma electrical resistivity and also to show the failure of Ohm’s 
law when the electric field exceeds a critical value called the Dreicer limit. Chapter 14 
considers two manifestations of wave-particle nonlinearity: (i) quasi-linear velocity space 
diffusion due to weak turbulence and (ii) echoes, non-linear phenomena which validate the 
concepts underlying Landau damping. Chapter 15 discusses how nonlinear interactions en- 
able energy and momentum to be transferred between waves, categorizes the large number 
of such wave-wave nonlinear interactions, and shows how these various interactions are all 
based on a few fundamental concepts. Chapter 16 discusses one-component plasmas (pure 
electron or pure ion plasmas) and shows how these plasmas have behaviors differing from 
conventional two-component, electron-ion plasmas. Chapter 17 discusses dusty plasmas 
which are three component plasmas (electrons, ions, and dust grains) and shows how the 
addition of a third component also introduces new behaviors, including the possibility of 
the dusty plasma condensing into a crystal. The analysis of condensation involves revisit- 
ing the Debye shielding concept and so corresponds, in a sense to having the book end on 
the same note it started on. 

I would like to extend my grateful appreciation to Professor Michael Brown at 
Swarthmore College for providing helpful feedback obtained from using a draft version in 
a seminar course at Swarthmore and to Professor Roy Gould at Caltech for providing useful 
suggestions. I would also like to thank graduate students Deepak Kumar and Gunsu Yun for 
carefully scrutinizing the final drafts of the manuscript and pointing out both ambiguities 
in presentation and typographical errors. I would also like to thank the many students who, 
over the years, provided useful feedback on earlier drafts of this work when it was in the 
form of lecture notes. Finally, I would like to acknowledge and thank my own mentors and 
colleagues who have introduced me to the many fascinating ideas constituting the discipline 
of plasma physics and also the many scientists whose hard work over many decades has 
led to the development of this discipline. 


Paul M. Bellan 
Pasadena, California 
September 30, 2004 


Basic concepts 


1.1 History of the term “plasma” 


In the mid-19th century the Czech physiologist Jan Evangelista Purkinje introduced use 
of the Greek word plasma (meaning “formed or molded”) to denote the clear fluid which 
remains after removal of all the corpuscular material in blood. Half a century later, the 
American scientist Irving Langmuir proposed in 1922 that the electrons, ions and neutrals 
in an ionized gas could similarly be considered as corpuscular material entrained in some 
kind of fluid medium and called this entraining medium plasma. However it turned out that 
unlike blood where there really is a fluid medium carrying the corpuscular material, there 
actually is no “fluid medium” entraining the electrons, ions, and neutrals in an ionized gas. 
Ever since, plasma scientists have had to explain to friends and acquaintances that they 
were not studying blood! 


1.2 Brief history of plasma physics 


In the 1920’s and 1930’s a few isolated researchers, each motivated by a specific practi- 
cal problem, began the study of what is now called plasma physics. This work was mainly 
directed towards understanding (1) the effect of ionospheric plasma on long distance short- 
wave radio propagation and (ii) gaseous electron tubes used for rectification, switching 
and voltage regulation in the pre-semiconductor era of electronics. In the 1940’s Hannes 
Alfvén developed a theory of hydromagnetic waves (now called Alfvén waves) and pro- 
posed that these waves would be important in astrophysical plasmas. In the early 1950’s 
large-scale plasma physics based magnetic fusion energy research started simultaneously 
in the USA, Britain and the then Soviet Union. Since this work was an offshoot of ther- 
monuclear weapon research, it was initially classified but because of scant progress in each 
country’s effort and the realization that controlled fusion research was unlikely to be of mil- 
itary value, all three countries declassified their efforts in 1958 and have cooperated since. 
Many other countries now participate in fusion research as well. 

Fusion progress was slow through most of the 1960’s, but by the end of that decade the 
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empirically developed Russian tokamak configuration began producing plasmas with pa- 
rameters far better than the lackluster results of the previous two decades. By the 1970’s 
and 80’s many tokamaks with progressively improved performance were constructed and 
at the end of the 20th century fusion break-even had nearly been achieved in tokamaks. 
International agreement was reached in the early 21st century to build the International 
Thermonuclear Experimental Reactor (ITER), a break-even tokamak designed to produce 
500 megawatts of fusion output power. Non-tokamak approaches to fusion have also been 
pursued with varying degrees of success; many involve magnetic confinement schemes 
related to that used in tokamaks. In contrast to fusion schemes based on magnetic con- 
finement, inertial confinement schemes were also developed in which high power lasers or 
similarly intense power sources bombard millimeter diameter pellets of thermonuclear fuel 
with ultra-short, extremely powerful pulses of strongly focused directed energy. The in- 
tense incident power causes the pellet surface to ablate and in so doing, act like a rocket 
exhaust pointing radially outwards from the pellet. The resulting radially inwards force 
compresses the pellet adiabatically, making it both denser and hotter; with sufficient adia- 
batic compression, fusion ignition conditions are predicted to be achieved. 

Simultaneous with the fusion effort, there has been an equally important and extensive 
study of space plasmas. Measurements of near-Earth space plasmas such as the aurora 
and the ionosphere have been obtained by ground-based instruments since the late 19th 
century. Space plasma research was greatly stimulated when it became possible to use 
spacecraft to make routine in situ plasma measurements of the Earth’s magnetosphere, the 
solar wind, and the magnetospheres of other planets. Additional interest has resulted from 
ground-based and spacecraft measurements of topologically complex, dramatic structures 
sometimes having explosive dynamics in the solar corona. Using radio telescopes, optical 
telescopes, Very Long Baseline Interferometry and most recently the Hubble and Spitzer 
spacecraft, large numbers of astrophysical jets shooting out from magnetized objects such 
as stars, active galactic nuclei, and black holes have been observed. Space plasmas often 
behave in a manner qualitatively similar to laboratory plasmas, but have a much grander 
scale. 

Since the 1960’s an important effort has been directed towards using plasmas for space 
propulsion. Plasma thrusters have been developed ranging from small ion thrusters for 
spacecraft attitude correction to powerful magnetoplasmadynamic thrusters that -given an 
adequate power supply — could be used for interplanetary missions. Plasma thrusters are 
now in use on some spacecraft and are under serious consideration for new and more am- 
bitious spacecraft designs. 

Starting in the late 1980’s a new application of plasma physics appeared — plasma 
processing — a critical aspect of the fabrication of the tiny, complex integrated circuits 
used in modern electronic devices. This application is now of great economic importance. 

In the 1990’s studies began on dusty plasmas. Dust grains immersed in a plasma can 
become electrically charged and then act as an additional charged particle species. Be- 
cause dust grains are massive compared to electrons or ions and can be charged to varying 
amounts, new physical behavior occurs that is sometimes an extension of what happens 
in a regular plasma and sometimes altogether new. In the 1980’s and 90’s there has also 
been investigation of non-neutral plasmas, these mimic the equations of incompressible 
hydrodynamics and so provide a compelling analog computer for problems in incompress- 
ible hydrodynamics. Both dusty plasmas and non-neutral plasmas can also form bizarre 
strongly coupled collective states where the plasma resembles a solid (e.g., forms quasi- 
crystalline structures). Another application of non-neutral plasmas is as a means to store 
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large quantities of positrons. 

In addition to the above activities there have been continuing investigations of indus- 
trially relevant plasmas such as arcs, plasma torches, and laser plasmas. In particular, 
approximately 40% of the steel manufactured in the United States is recycled in huge elec- 
tric arc furnaces capable of melting over 100 tons of scrap steel in a few minutes. Plasma 
displays are used for flat panel televisions and of course there are naturally-occurring ter- 
restrial plasmas such as lightning. 


1.3 Plasma parameters 


Three fundamental parameters! characterize a plasma: 
1. the particle density n (measured in particles per cubic meter), 


2. the temperature T of each species (usually measured in eV, where 1 eV=11,605 K), 


3. the steady state magnetic field B (measured in Tesla). 

A host of subsidiary parameters (e.g., Debye length, Larmor radius, plasma frequency, 
cyclotron frequency, thermal velocity) can be derived from these three fundamental para- 
meters. For partially-ionized plasmas, the fractional ionization and cross-sections of neu- 
trals are also important. 


1.4 Examples of plasmas 


1.4.1 Non-fusion terrestrial plasmas 


It takes considerable resources and skill to make a hot, fully ionized plasma and so, ex- 
cept for the specialized fusion plasmas, most terrestrial plasmas (e.g., arcs, neon signs, 
fluorescent lamps, processing plasmas, welding arcs, and lightning) have electron tem- 
peratures of a few eV, and for reasons given later, have ion temperatures that are colder, 
often at room temperature. These ‘everyday’ plasmas usually have no imposed steady state 
magnetic field and do not produce significant self magnetic fields. Typically, these plas- 
mas are weakly ionized and dominated by collisional and radiative processes. Densities in 
these plasmas range from 10!* to 10?2m~? (for comparison, the density of air at STP is 
2.7 x 1075m~?). 


1.4.2 Fusion-grade terrestrial plasmas 


Using carefully designed, expensive, and often large plasma confinement systems together 
with high heating power and obsessive attention to purity, fusion researchers have suc- 
ceeded in creating fully ionized hydrogen or deuterium plasmas which attain temperatures 


TIn older plasma literature, density and magnetic fields are often expressed in cgs units, i.e., densities are given 
in particles per cubic centimeter, and magnetic fields are given in Gauss. Since the 1990’s there has been general 
agreement to use SI units when possible. SI units have the distinct advantage that electrical units are in terms of 
familiar quantities such as amps, volts, and ohms and so a model prediction in SI units can much more easily be 
compared to the results of an experiment than a prediction given in cgs units. 
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in the range from 10’s of eV to tens of thousands of eV. In typical magnetic confinement 
devices (e.g., tokamaks, stellarators, reversed field pinches, mirror devices) an externally 
produced 1-10 Tesla magnetic field of carefully chosen geometry is imposed on the plasma. 
Magnetic confinement devices generally have densities in the range 101° — 10?!m~°. Plas- 
mas used in inertial fusion are much more dense; the goal is to attain for a brief instant 
densities one or two orders of magnitude larger than solid density (~ 10?’m~?), 


1.4.3 Space plasmas 


The parameters of these plasmas cover an enormous range. For example the density of 
space plasmas vary from 10° m~? in interstellar space, to 107° m~? in the solar atmosphere. 
Most of the astrophysical plasmas that have been investigated have temperatures in the 
range of 1-100 eV and these plasmas are usually fully ionized. 


1.5 Logical framework of plasma physics 


Plasmas are complex and exist in a wide variety of situations differing by many orders of 
magnitude. An important situation where plasmas do not normally exist is ordinary human 
experience. Consequently, people do not have the sort of intuition for plasma behavior that 
they have for solids, liquids or gases. Although plasma behavior seems non- or counter- 
intuitive at first, with suitable effort a good intuition for plasma behavior can be developed. 
This intuition can be helpful for making initial predictions about plasma behavior in a 
new situation, because plasmas have the remarkable property of being extremely scalable; 
i.e., the same qualitative phenomena often occur in plasmas differing by many orders of 
magnitude. Plasma physics is usually not a precise science. It is rather a web of overlapping 
points of view, each modeling a limited range of behavior. Understanding of plasmas is 
developed by studying these various points of view, all the while keeping in mind the 
linkages between the points of view. 


Lorentz equation 


(gives xj, vj for each particle from knowledge of E(x, t), B(x, f)) 


Maxwell equations 


(gives E(x,t), B(x, t) from knowledge of x;, v; for each particle) 
Figure 1.1: Interrelation between Maxwell’s equations and the Lorentz equation 


Plasma dynamics is determined by the self-consistent interaction between electromag- 
netic fields and statistically large numbers of charged particles as shown schematically in 
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Fig. 1.1. In principle, the time evolution of a plasma can be calculated as follows: 


1. given the trajectory x;(t) and velocity v;(t) of each and every particle j, the electric 
field E(x,t) and magnetic field B(x,t) can be evaluated using Maxwell’s equations, 


and simultaneously, 


2. given the instantaneous electric and magnetic fields E(x,t) and B(x,t), the forces on 
each and every particle 7 can be evaluated using the Lorentz equation and then used 
to update the trajectory x,(¢) and velocity v,(¢) of each particle. 

While this approach is conceptually easy to understand, it is normally impractical to im- 
plement because of the extremely large number of particles and to a lesser extent, because 
of the complexity of the electromagnetic field. To gain a practical understanding, we there- 
fore do not attempt to evaluate the entire complex behavior all at once but, instead, study 
plasmas by considering specific phenomena. For each phenomenon under immediate con- 
sideration, appropriate simplifying approximations are made, leading to a more tractable 
problem and hopefully revealing the essence of what is going on. A situation where a cer- 
tain set of approximations is valid and provides a self-consistent description is called a 
regime. There are a number of general categories of simplifying approximations, namely: 


1. Approximations involving the electromagnetic field: 


(a) assuming the magnetic field is zero (unmagnetized plasma) 


(b) assuming there are no inductive electric fields (electrostatic approximation) 


(c) neglecting the displacement current in Ampere’s law (suitable for phenomena 
having characteristic velocities much slower than the speed of light) 


(d) assuming that all magnetic fields are produced by conductors external to the 
plasma 


(e) various assumptions regarding geometric symmetry (e.g., spatially uniform, uni- 
form in a particular direction, azimuthally symmetric about an axis) 


2. Approximations involving the particle description: 


(a) averaging of the Lorentz force over some sub-group of particles: 


1. 


ii. 


iii. 


Vlasov theory: average over all particles of a given species (electrons or 
ions) having the same velocity at a given location and characterize the 
plasma using the distribution function f,(x,v,t) which gives the density 
of particles of species ø having velocity v at position x at time t 

two-fluid theory: average velocities over all particles of a given species 
at a given location and characterize the plasma using the species density 
nN, (x,t), mean velocity us (x, t), and pressure P, (x,t) defined relative to 
the species mean velocity 

magnetohydrodynamic theory: average momentum over all particles of all 
species and characterize the plasma using the center of mass density p(x, t), 
center of mass velocity U(x, t), and pressure P(x, t) defined relative to the 
center of mass velocity 


(b) assumptions about time (e.g., assume the phenomenon under consideration is 
fast or slow compared to some characteristic frequency of the particles such as 
the cyclotron frequency) 
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(c) assumptions about space (e.g., assume the scale length of the phenomenon under 
consideration is large or small compared to some characteristic plasma length 
such as the cyclotron radius) 


(d) assumptions about velocity (e.g., assume the phenomenon under consideration 
is fast or slow compared to the thermal velocity vr, of a particular species o) 


The large number of possible permutations and combinations that can be constructed 
from the above list means that there will be a large number of regimes. Since developing an 
intuitive understanding requires making approximations of the sort listed above and since 
these approximations lack an obvious hierarchy, it is not clear where to begin. In fact, 
as sketched in Fig.1.2, the models for particle motion (Vlasov, 2-fluid, MHD) involve a 
circular argument. Wherever we start on this circle, we are always forced to take at least 
one new concept on trust and hope that its validity will be established later. The reader is 
encouraged to refer to Fig.1.2 as its various components are examined so that the logic of 
this circle will eventually become clear. 


slow phenomena fast phenomena 


ieldi lasma oscillations 
Rutherford scattering Debye shielding p $ 


random walk ee 


nearly collisionless 
nature of plasmas 


Vlasov equation 


magnetohydrodynamics two-fluid equations 


Figure 1.2: Hierarchy of models of plasmas showing circular nature of logic. 


Because the argument is circular, the starting point is at the author’s discretion, and for 
good (but not overwhelming reasons), this author has decided that the optimum starting 
point on Fig. 1.2 is the subject of Debye shielding. Debye concepts, the Rutherford model 
for how charged particles scatter from each other, and some elementary statistics will be 
combined to construct an argument showing that plasmas are weakly collisional. We will 
then discuss phase-space concepts and introduce the Vlasov equation for the phase-space 
density. Averages of the Vlasov equation will provide two-fluid equations and also the 
magnetohydrodynamic (MHD) equations. Having established this framework, we will then 
return to study features of these points of view in more detail, often tying up loose ends that 
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occurred in our initial derivation of the framework. Somewhat separate from the study of 
Vlasov, two-fluid and MHD equations (which all attempt to give a self-consistent picture of 
the plasma) is the study of single particle orbits in prescribed fields. This provides useful 
intuition on the behavior of a typical particle in a plasma, and can provide important inputs 
or constraints for the self-consistent theories. 


1.6 Debye shielding 


We begin our study of plasmas by examining Debye shielding, a concept originating from 
the theory of liquid electrolytes (Debye and Huckel 1923). Consider a finite-temperature 
plasma consisting of a statistically large number of electrons and ions and assume that the 
ion and electron densities are initially equal and spatially uniform. As will be seen later, 
the ions and electrons need not be in thermal equilibrium with each other, and so the ions 
and electrons will be allowed to have separate temperatures denoted by 7;, Te. 

Since the ions and electrons have random thermal motion, thermally induced perturba- 
tions about the equilibrium will cause small, transient spatial variations of the electrostatic 
potential ¢. In the spirit of circular argument the following assumptions are now invoked 
without proof: 

1. The plasma is assumed to be nearly collisionless so that collisions between particles 
may be neglected to first approximation. 


2. Each species, denoted as ø, may be considered as a ‘fluid’ having a density no, a 
temperature T,, a pressure P, = nT, (K is Boltzmann’s constant), and a mean 
velocity u, so that the collisionless equation of motion for each fluid is 


1 
=g,E—-— —VP, 1.1 
yi ee (1.1) 


where mo is the particle mass, qo is the charge of a particle, and E is the electric field. 
Now consider a perturbation with a sufficiently slow time dependence to allow the fol- 
lowing assumptions: 
1. The inertial term ~ d/dt on the left hand side of Eq.(1.1) is negligible and may be 
dropped. 


2. Inductive electric fields are negligible so the electric field is almost entirely electrosta- 
tic, i.e., E ~ —Vo. 


3. All temperature gradients are smeared out by thermal particle motion so that the tem- 
perature of each species is spatially uniform. 


4. The plasma remains in thermal equilibrium throughout the perturbation (i.e., can al- 
ways be characterized by a temperature). 
Invoking these approximations, Eq.(1.1) reduces to 


0 Nole V h — KT VNo, (1.2) 


a simple balance between the force due to the electrostatic electric field and the force due 
to the isothermal pressure gradient. Equation (1.2) is readily solved to give the Boltzmann 
relation 

Ng = noo €XP(—ab/KT) (1.3) 
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where noo is a constant. It is important to emphasize that the Boltzmann relation results 
from the assumption that the perturbation is very slow; if this is not the case, then inertial 
effects, inductive electric fields, or temperature gradient effects will cause the plasma to 
have a completely different behavior from the Boltzmann relation. Situations exist where 
this ‘slowness’ assumption is valid for electron dynamics but not for ion dynamics, in 
which case the Boltzmann condition will apply only to the electrons but not to the ions 
(the converse situation does not normally occur, because ions, being heavier, are always 
more sluggish than electrons and so it is only possible for a phenomena to appear slow to 
electrons but not to ions). 

Let us now imagine slowly inserting a single additional particle (so-called “test” par- 
ticle) with charge qr into an initially unperturbed, spatially uniform neutral plasma. To 
keep the algebra simple, we define the origin of our coordinate system to be at the location 
of the test particle. Before insertion of the test particle, the plasma potential was ¢ = 0 
everywhere because the ion and electron densities were spatially uniform and equal, but 
now the ions and electrons will be perturbed because of their interaction with the test par- 
ticle. Particles having the same polarity as qr will be slightly repelled whereas particles of 
opposite polarity will be slightly attracted. The slight displacements resulting from these 
repulsions and attractions will result in a small, but finite potential in the plasma. This po- 
tential will be the superposition of the test particle’s own potential and the potential of the 
plasma particles that have moved slightly in response to the test particle. 

This slight displacement of plasma particles is called shielding or screening of the test 
particle because the displacement tends to reduce the effectiveness of the test particle field. 
To see this, suppose the test particle is a positively charged ion. When immersed in the 
plasma it will attract nearby electrons and repel nearby ions; the net result is an effectively 
negative charge cloud surrounding the test particle. An observer located far from the test 
particle and its surrounding cloud would see the combined potential of the test particle and 
its associated cloud. Because the cloud has the opposite polarity of the test particle, the 
cloud potential will partially cancel (1.e., shield or screen) the test particle potential. 

Screening is calculated using Poisson’s equation with the source terms being the test 
particle and its associated cloud. The cloud contribution is determined using the Boltz- 
mann relation for the particles that participate in the screening. This is a ‘self-consistent’ 
calculation for the potential because the shielding cloud is affected by its self-potential. 

Thus, Poisson’s equation becomes 


1 
V?o = E qro(r) + Dol) do (1.4) 


where the term qrô(r) on the right hand side represents the charge density due to the test 
particle and the term 5*n,(r)qo represents the charge density of all plasma particles that 
participate in the screening (i.e., everything except the test particle). Before the test particle 
was inserted X` —; e no(r)qo vanished because the plasma was assumed to be initially 
neutral. 

Since the test particle was inserted slowly, the plasma response will be Boltzmann-like 
and we may substitute for n,(r) using Eq.(1.3). Furthermore, because the perturbation 
due to a single test particle is infinitesimal, we can safely assume that |qoġ| << «KT;, in 
which case Eq.(1.3) becomes simply ng œ% noo(l — dc¢/KT,). The assumption of initial 
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neutrality means that }>,_; e nooo = O causing the terms independent of ¢ to cancel in 
Eq.(1.4) which thus reduces to 


c 1 qT 
V’ - — p =- 0(r 1.5 
$- =— Fae) (15) 
where the effective Debye length is defined by 
1 1 
= > (1.6) 
BAe 
and the species Debye length As is 
To 
a2 = Oe (1.7) 
Nool 


The second term on the left hand side of Eq.(1.5) is just the negative of the shielding 
cloud charge density. The summation in Eq.(1.6) is over all species that participate in the 
shielding. Since ions cannot move fast enough to keep up with an electron test charge 
which would be moving at the nominal electron thermal velocity, the shielding of electrons 
is only by other electrons, whereas the shielding of ions is by both ions and electrons. 

Equation (1.5) can be solved using standard mathematical techniques (cf. assignments) 
to give 

—_ IT -r/Ap 

or) = rÀ RB (1.8) 
For r << Ap the potential ¢(r) is identical to the potential of a test particle in vacuum 
whereas for r >> Ap the test charge is completely screened by its surrounding shielding 
cloud. The nominal radius of the shielding cloud is Ap. Because the test particle is com- 
pletely screened for r >> Ap, the total shielding cloud charge is equal in magnitude to the 
charge on the test particle and opposite in sign. This test-particle/shielding-cloud analy- 
sis makes sense only if there is a macroscopically large number of plasma particles in the 
shielding cloud; i.e., the analysis makes sense only if Anno X3, /3 >> 1. This will be seen 
later to be the condition for the plasma to be nearly collisionless and so validate assumption 
#1 in Sec.1.6. 

In order for shielding to be a relevant issue, the Debye length must be small compared 
to the overall dimensions of the plasma, because otherwise no point in the plasma could be 
outside the shielding cloud. Finally, it should be realized that any particle could have been 
construed as being ‘the’ test particle and so we conclude that the time-averaged effective 
potential of any selected particle in the plasma is given by Eq. (1.8) (from a statistical point 
of view, selecting a particle means that it no longer is assumed to have a random thermal 
velocity and its effective potential is due to its own charge and to the time average of the 
random motions of the other particles). 


1.7 Quasi-neutrality 


The Debye shielding analysis above assumed that the plasma was initially neutral, i.e., that 
the initial electron and ion densities were equal. We now demonstrate that if the Debye 
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length is a microscopic length, then it is indeed an excellent assumption that plasmas re- 
main extremely close to neutrality, while not being exactly neutral. It is found that the 
electrostatic electric field associated with any reasonable configuration is easily produced 
by having only a tiny deviation from perfect neutrality. This tendency to be quasi-neutral 
occurs because a conventional plasma does not have sufficient internal energy to become 
substantially non-neutral for distances greater than a Debye length (there do exist non- 
neutral plasmas which violate this concept, but these involve rotation of plasma in a back- 
ground magnetic field which effectively plays the neutralizing role of ions in a conventional 
plasma). 

To prove the assertion that plasmas tend to be quasi-neutral, we consider an initially 
neutral plasma with temperature T and calculate the largest radius sphere that could spon- 
taneously become depleted of electrons due to thermal fluctuations. Let fmax be the radius 
of this presumed sphere. Complete depletion (i.e., maximum non-neutrality) would occur 
if a random thermal fluctuation caused all the electrons originally in the sphere to vacate 
the volume of the sphere and move to its surface. The electrons would have to come to rest 
on the surface of the presumed sphere because if they did not, they would still have avail- 
able kinetic energy which could be used to move out to an even larger radius, violating the 
assumption that the sphere was the largest radius sphere which could become fully depleted 
of electrons. This situation is of course extremely artificial and likely to be so rare as to be 
essentially negligible because it requires all the electrons to be moving radially relative to 
some origin. In reality, the electrons would be moving in random directions. 

When the electrons exit the sphere they leave behind an equal number of ions. The 
remnant ions produce a radial electric field which pulls the electrons back towards the 
center of the sphere. One way of calculating the energy stored in this system is to calculate 
the work done by the electrons as they leave the sphere and collect on the surface, but a 
simpler way is to calculate the energy stored in the electrostatic electric field produced by 
the ions remaining in the sphere. This electrostatic energy did not exist when the electrons 
were initially in the sphere and balanced the ion charge and so it must be equivalent to the 
work done by the electrons on leaving the sphere. 

The energy density of an electric field is <9 Æ? /2 and because of the spherical symmetry 
assumed here the electric field produced by the remnant ions must be in the radial direction. 
The ion charge in a sphere of radius r is Q = 4mner? /3 and so after all the electrons have 
vacated the sphere, the electric field at radius r is E, = Q/4reor? = ner/3e9. Thus 
the energy stored in the electrostatic field resulting from complete lack of neutralization of 
ions in a sphere of radius fmax 1S 


Tmax F2 2 22 
W = J OE er dr = rr’ vee 
0 


; 1. 
Daa, (19) 


Equating this potential energy to the initial electron thermal kinetic energy Wyinetic 
gives 


Qnze2 3 4 
TT ig re = grat x atts (1.10) 
which may be solved to give 
T 
rex = 4528 (1.11) 


Nee? 
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so that Trax S TAp. 

Thus, the largest spherical volume that could spontaneously become fully depleted of 
electrons has a radius of a few Debye lengths, but this would require the highly unlikely 
situation of having all the electrons initially moving in the outward radial direction. We 
conclude that the plasma is quasi-neutral over scale lengths much larger than the Debye 
length. When a biased electrode such as a wire probe is inserted into a plasma, the plasma 
screens the field due to the potential on the electrode in the same way that the test charge 
potential was screened. The screening region is called the sheath, which is a region of 
non-neutrality having an extent of the order of a Debye length. 


1.8 Small v. large angle collisions in plasmas 


We now consider what happens to the momentum and energy of a test particle of charge 
qr and mass mr that is injected with velocity vr into a plasma. This test particle will 
make a sequence of random collisions with the plasma particles (called “field” particles 
and denoted by subscript F’); these collisions will alter both the momentum and energy of 
the test particle. 


m/2 scattering 


small 
angle 
scattering 


differential cross section 2abdb 


cross section 2b2,, for small angle scattering 


for large angle scattering 


Figure 1.3: Differential scattering cross sections for large and small deflections 


Solution of the Rutherford scattering problem in the center of mass frame shows (see 
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assignment 1, this chapter) that the scattering angle 0 is given by 


Er (5) __ 7g Coulomb interaction energy (1.12) 


2)  Aneqbuve kinetic energy 


where pp! = ma +m a is the reduced mass, b is the impact parameter, and vo is the 


initial relative velocity. It is useful to separate scattering events (i.e., collisions) into two 
approximate categories, namely (1) large angle collisions where 7/2 < 6 < 7 and (2) 
small angle (grazing) collisions where 0 << 1/2. 

Let us denote b,,/2 as the impact parameter for 90 degree collisions; from Eq.(1.12) this 
is 

On /2 = Tene (1.13) 

and is the radius of the inner (small) shaded circle in Fig.1.3. Large angle scatterings will 
occur if the test particle is incident anywhere within this circle and so the total cross section 
for all large angle collisions is 


aS 2 
Olarge ~ 17/2 


2 
= (2 z) l (1.14) 
Anequve 

Grazing (small angle) collisions occur when the test particle impinges outside the shaded 
circle and so occur much more frequently than large angle collisions. Although each graz- 
ing collision does not scatter the test particle by much, there are far more grazing collisions 
than large angle collisions and so it is important to compare the cumulative effect of graz- 
ing collisions with the cumulative effect of large angle collisions. 

To make matters even more complicated, the effective cross-section of grazing colli- 
sions depends on impact parameter, since the larger b is, the smaller the scattering. To take 
this weighting of impact parameters into account, the area outside the shaded circle is sub- 
divided into a set of concentric annuli, called differential cross-sections. If the test particle 
impinges on the differential cross-section having radii between b and b + db, then the test 
particle will be scattered by an angle lying between 0(b) and 0(b + db) as determined by 
Eq.(1.12). The area of the differential cross-section is 271bdb which is therefore the effec- 
tive cross-section for scattering between 0(b) and 6(b + db). Because the azimuthal angle 
about the direction of incidence is random, the simple average of N small angle scatterings 
vanishes, i.e., N~* D 0; = 0 where 0; is the scattering due to the it? collision and N 
is a large number. 

Random walk statistics must therefore be used to describe the cumulative effect of 
small angle scatterings and so we will use the square of the scattering angle, i.e. 07, as the 
quantity for comparing the cumulative effects of small (grazing) and large angle collisions. 
Thus, scattering is a diffusive process. 

To compare the respective cumulative effects of grazing and large angle collisions we 
calculate how many small angle scatterings must occur to be equivalent to a single large 
angle scattering (1.e. Oras e ~ 1); here we pick the nominal value of the large angle scat- 


tering to be 1 radian. In other words, we ask what must NV be in order to have D x1 


where each 0; represents an individual small angle scattering event. Equivalently, we may 
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ask what time t do we have to wait for the cumulative effect of the grazing collisions on a 
test particle to give an effective scattering equivalent to a single large angle scattering? 

To calculate this, let us imagine we are “sitting” on the test particle. In this test particle 
frame the field particles approach the test particle with the velocity v-e; and so the apparent 
flux of field particles is T = n pUre, where vre; is the relative velocity between the test and 
field particles. The number of small angle scattering events in time t for impact parameters 
between b and b + db is [¢27bdb and so the time required for the cumulative effect of small 
angle collisions to be equivalent to a large angle collision is given by 


N 
1x = ref zra. (1.15) 
i=1 


The definitions of scattering theory show (see assignment 9) that oT = t~! where ø is the 
cross section for an event and t is the time one has to wait for the event to occur. Substituting 
for Tt in Eq.(1.15) gives the cross-section o* for the cumulative effect of grazing collisions 
to be equivalent to a single large angle scattering event, 


C= J mw. (1.16) 


The appropriate lower limit for the integral in Eq.(1.16) is 0, /2, since impact parameters 
smaller than this value produce large angle collisions. What should the upper limit of the 
integral be? We recall from our Debye discussion that the field of the scattering center 
is screened out for distances greater than Ap. Hence, small angle collisions occur only 
for impact parameters in the range b;/;2 < b < Ap because the scattering potential is 
non-existent for distances larger than Ap. 

For small angle collisions, Eq.(1.12) gives 


qT IF 
0b) = =. 1.17 
(0) QrEeouved ery) 
so that Eq. (1.7.3) becomes 
àD 2 
o =) 2rbdb (5) (1.18) 
Beis 2TEQLVAb 
or P 
o* = sin ( P ) gige: (1.19) 
On /2 


Thus, if Ap /br /2 >> 1 the cross section o* will significantly exceed Ojarge. Since 
bn /2 = 1/2n3,, the condition Ap >> 0, /2 is equivalent to nA >> 1, which is just 
the criterion for there to be a large number of particles in a sphere having radius Ap (a 
so-called Debye sphere). This was the condition for the Debye shielding cloud argument to 
make sense. We conclude that the criterion for an ionized gas to behave as a plasma (i.e., 
Debye shielding is important and grazing collisions dominate large angle collisions) is the 
condition that nà3 >> 1. For most plasmas HN is a large number with natural logarithm 
of order 10; typically, when making rough estimates of o*, one uses In(Ap/by/2) © 10. 
The reader may have developed a concern about the seeming arbitrary nature of the choice 
of bz/2 as the “dividing line’ between large angle and grazing collisions. This arbitrariness 
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is of no consequence since the logarithmic dependence means that any other choice having 
the same order of magnitude for the ‘dividing line’ would give essentially the same result. 
By substituting for 0,,/2 the cross section can be re-written as 


2 
1 ; X 
Pace in ety (1.20) 
27m \ EoLvg On/2 


Thus, o* decreases approximately as the fourth power of the relative velocity. In a hot 
plasma where vo is large, o* will be very small and so scattering by Coulomb collisions is 
often much less important than other phenomena. A useful way to decide whether Coulomb 
collisions are important is to compare the collision frequency v = o* nv with the frequency 
of other effects, or equivalently the mean free path of collisions lmfp = 1/o*n with the 
characteristic length of other effects. If the collision frequency is small, or the mean free 
path is large (in comparison to other effects) collisions may be neglected to first approx- 
imation, in which case the plasma under consideration is called a collisionless or “ideal” 
plasma. The effective Coulomb cross section o*and its related parameters v and lm fp can 
be used to evaluate transport properties such as electrical resistivity, mobility, and diffusion. 


1.9 Electron and ion collision frequencies 


One of the fundamental physical constants influencing plasma behavior is the ion to elec- 
tron mass ratio. The large value of this ratio often causes electrons and ions to experience 
qualitatively distinct dynamics. In some situations, one species may determine the essen- 
tial character of a particular plasma behavior while the other species has little or no effect. 
Let us now examine how mass ratio affects: 

1. Momentum change (scattering) of a given incident particle due to collision between 


(a) like particles (1.e., electron-electron or ion-ion collisions, denoted ee or 22), 


(b) unlike particles (i.e., electrons scattering from ions denoted et or ions scattering 
from electrons denoted ie), 


2. Kinetic energy change (scattering) of a given incident particle due to collisions be- 
tween like or unlike particles. 

Momentum scattering is characterized by the time required for collisions to deflect the 
incident particle by an angle 7/2 from its initial direction, or more commonly, by the 
inverse of this time, called the collision frequency. The momentum scattering collision 
frequencies are denoted as Vee, Vii, Vei, Vie for the various possible interactions between 
species and the corresponding times as Tee, etc. Energy scattering is characterized by the 
time required for an incident particle to transfer all its kinetic energy to the target particle. 
Energy transfer collision frequencies are denoted respectively by V zee, VE ii, VEei, VE ie- 

We now show that these frequencies separate into categories having three distinct orders 
of magnitude having relative scalings 1 : (m;/m,)!/? : m;/me. In order to estimate the 
orders of magnitude of the collision frequencies we assume the incident particle is ‘typical’ 
for its species and so take its incident velocity to be the species thermal velocity vr, = 
(2kT, /m,)'/?. While this is reasonable for a rough estimate, it should be realized that, 
because of the v-* dependence in o*, a more careful averaging over all particles in the 
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thermal distribution will differ somewhat. This careful averaging is rather involved and 
will be deferred to Chapter 13. 

We normalize all collision frequencies to Vee, and for further simplification assume that 
the ion and electron temperatures are of the same order of magnitude. First consider vei: the 
reduced mass for eż collisions is the same as for ee collisions (except for a factor of 2 which 
we neglect), the relative velocity is the same — hence, we conclude that Ve; ~ vee. Now 
consider v;;: because the temperatures were assumed equal, o% ~ až, and so the collision 
frequencies will differ only because of the different velocities in the expression v = nov. 
The ion thermal velocity is lower by an amount (Mme /m;) 1/2 giving Vi; = (Me /mj)"/? Vee. 

Care is required when calculating Vie. Strictly speaking, this calculation should be done 
in the center of mass frame and then transformed back to the lab frame, but an easy way 
to estimate Vie using lab-frame calculations is to note that momentum is conserved in a 
collision so that in the lab frame m;Av; = —m-Av. where A means the change in a 
quantity as a result of the collision. If the collision of an ion head-on with a stationary 
electron is taken as an example, then the electron bounces off forward with twice the ion’s 
velocity (corresponding to a specular reflection of the electron in a frame where the ion 
is stationary); this gives Av. = 2v; and |Av;| /|v;| = 2m./m,;.Thus, in order to have 
|Av,| /|v;| of order unity, it is necessary to have m; /me head-on collisions of an ion with 
electrons whereas in order to have |Ave| / |ve] of order unity it is only necessary to have 
one collision of an electron with an ion. Hence Vie ~ (Me/Mj)Vec. 

Now consider energy changes in collisions. If a moving electron makes a head-on 
collision with an electron at rest, then the incident electron stops (loses all its momentum 
and energy) while the originally stationary electron flies off with the same momentum and 
energy that the incident electron had. A similar picture holds for an ion hitting an ion. 
Thus, like-particle collisions transfer energy at the same rate as momentum SO V Zee ~ Vee 
and VEii ~ Vii. 

Inter-species collisions are more complicated. Consider an electron hitting a stationary 
ion head-on. Because the ion is massive, it barely recoils and the electron reflects with a 
velocity nearly equal in magnitude to its incident velocity. Thus, the change in electron 
momentum is —2MeVe. From conservation of momentum, the momentum of the recoiling 
ion must be m;v; = 2MeVe. The energy transferred to the ion in this collision is miv? (2 
A(m./m;)m_v2/2. Thus, an electron has to make ~ m;/m- such collisions in order to 
transfer all its energy to ions. Hence, V gei = (Me/Mi)Vec- 

Similarly, if an incident ion hits an electron at rest the electron will fly off with twice 
the incident ion velocity (in the center of mass frame, the electron is reflecting from the 
ion). The electron gains energy mev? /2 so that again ~mi; /Me collisions are required for 
the ion to transfer all its energy to electrons. 

We now summarize the orders of magnitudes of collision frequencies in the table below. 


~1 | ~(me/mj)!/?7 | ~me/m; 
Vee Vii Vie 

Vei VE V Eei 

V Eee V Bie 


Although collisions are typically unimportant for fast transient processes, they may 
eventually determine many properties of a given plasma. The wide disparity of collision 
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frequencies shows that one has to be careful when determining which collisional process is 
relevant to a given phenomenon. Perhaps the best way to illustrate how collisions must be 
considered is by an example, such as the following: 

Suppose half the electrons in a plasma initially have a directed velocity vo while the 
other half of the electrons and all the ions are initially at rest. This may be thought of as a 
high density beam of electrons passing through a cold plasma. On the fast (i.e., Vee) time 
scale the beam electrons will: 

(1) collide with the stationary electrons and share their momentum and energy so that 
after a time of order vz the beam will become indistinguishable from the background 
electrons. Since momentum must be conserved, the combined electrons will have a mean 
velocity vo /2. 

(ii) collide with the stationary ions which will act as nearly fixed scattering centers so 
that the beam electrons will scatter in direction but not transfer significant energy to the 
ions. 

Both the above processes will randomize the velocity distribution of the electrons until 
this distribution becomes Maxwellian (the maximum entropy distribution); the Maxwellian 
will be centered about the average velocity discussed in (i) above. 

On the very slow v pge; time scale (down by a factor m;/m,) the electrons will trans- 
fer momentum to the ions, so on this time scale the electrons will share their momentum 
with the ions, in which case the electrons will slow down and the ions will speed up until 
eventually electrons and ions have the same momentum. Similarly the electrons will share 
energy with the ions in which case the ions will heat up while the electrons will cool. 

If, instead, a beam of ions were injected into the plasma, the ion beam would thermalize 
and share momentum with the background ions on the intermediate v;; time scale, and then 
only share momentum and energy with the electrons on the very slow v gie time scale. 

This collisional sharing of momentum and energy and thermalization of velocity dis- 
tribution functions to make Maxwellians is the process by which thermodynamic equilib- 
rium is achieved. Collision frequencies vary as T~*/? and so, for hot plasmas, collision 
processes are often slower than many other phenomena. Since collisions are the means by 
which thermodynamic equilibrium is achieved, plasmas are typically not in thermodynamic 
equilibrium, although some components of the plasma may be in a partial equilibrium (for 
example, the electrons may be in thermal equilibrium with each other but not with the ions). 
Hence, thermodynamically based descriptions of the plasma are often inappropriate. It is 
not unusual, for example, to have a plasma where the electron and ion temperatures dif- 
fer by more than an order of magnitude. This can occur when one species or the other 
has been subject to heating and the plasma lifetime is shorter than the interspecies energy 
equilibration time ~ Upa 


1.10 Collisions with neutrals 


If a plasma is weakly ionized then collisions with neutrals must be considered. These 
collisions differ fundamentally from collisions between charged particles because now the 
interaction forces are short-range (unlike the long-range Coulomb interaction) and so the 
neutral can be considered simply as a hard body with cross-section of the order of its actual 
geometrical size. All atoms have radii of the order of 10~!°m so the typical neutral cross 
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section is Cneu ~ 3 X 10720 m?. When a particle hits a neutral it can simply scatter with 
no change in the internal energy of the neutral; this is called elastic scattering. It can also 
transfer energy to the structure of the neutral and so cause an internal change in the neutral; 
this is called inelastic scattering. Inelastic scattering includes ionization and excitation of 
atomic level transitions (with accompanying optical radiation). 

Another process can occur when ions collide with neutrals — the incident ion can cap- 
ture an electron from the neutral and become neutralized while simultaneously ionizing the 
original neutral. This process, called charge exchange is used for producing energetic neu- 
tral beams. In this process a high energy beam of ions is injected into a gas of neutrals, 
captures electrons, and exits as a high energy beam of neutrals. 

Because ions have approximately the same mass as neutrals, ions rapidly exchange 
energy with neutrals and tend to be in thermal equilibrium with the neutrals if the plasma 
is weakly ionized. As a consequence, ions are typically cold in weakly ionized plasmas, 
because the neutrals are in thermal equilibrium with the walls of the container. 


1.11 Simple transport phenomena 


1. Electrical resistivity- When a uniform electric field E exists in a plasma, the electrons 
and ions are accelerated in opposite directions creating a relative momentum between 
the two species. At the same time electron-ion collisions dissipate this relative mo- 
mentum so it is possible to achieve a steady state where relative momentum creation 
(i.e., acceleration due to the E field) is balanced by relative momentum dissipation due 
to interspecies collisions (this dissipation of relative momentum is known as ‘drag’). 
The balance of forces on the electrons gives 

e 


0 = ——E — Veirel (1.21) 
Me 


since the drag is proportional to the relative velocity u,e; between electrons and ions. 


However, the electric current is just J = —n,eu,;¢) so that Eq.(1.21) can be re-written 
as 
E= 7J (1.22) 
where oe hey 
eee! 1.23 
ear (1.23) 


is the plasma electrical resistivity. Substituting ve; = o*n,vre and noting from quasi- 
neutrality that Zn; = Ne the plasma electrical resistivity is 


2 
n A in ( AD ) (1.24) 


a 27,3 
2TMeEVTe On /2 


from which we see that resistivity is independent of density, proportional to pees = 


and also proportional to the ion charge Z. This expression for the resistivity is only 
approximate since we did not properly average over the electron velocity distribution 
(a more accurate expression, differing by a factor of order unity, will be derived in 
Chapter 13). Resistivity resulting from grazing collisions between electrons and ions 
as given by Eq.(1.24) is known as Spitzer resistivity (Spitzer and Harm 1953). It 
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should be emphasized that although this discussion assumes existence of a uniform 
electric field in the plasma, a uniform field will not exist in what naively appears to 
be the most obvious geometry, namely a plasma between two parallel plates charged 
to different potentials. This is because Debye shielding will concentrate virtually all 
the potential drop into thin sheaths adjacent to the electrodes, resulting in near-zero 
electric field inside the plasma. A practical way to obtain a uniform electric field is to 
create the field by induction so that there are no electrodes that can be screened out. 


Diffusion and ambipolar diffusion- Standard random walk arguments show that parti- 
cle diffusion coefficients scale as D ~ (Az)? /T where Az is the characteristic step 
size in the random walk and 7 is the time between steps. This can also be expressed 
as D ~ v2 /v where v = T ' is the collision frequency and vr = Ax/t = vAz is 
the thermal velocity. Since the random step size for particle collisions is the mean free 
path and the time between steps is the inverse of the collision frequency, the electron 
diffusion coefficient in an unmagnetized plasma scales as 

D. = vel uve (1.25) 


Re T mav. 


where Ve = Vee + Vei ~ Vee is the 90° scattering rate for electrons and lm fpe = 
\/KT./mev2 is the electron mean free path. Similarly, the ion diffusion coefficient in 
an unmagnetized plasma is 


D; = vil; = (1.26) 


MV; 


where Vv; = Vii + Vie ~ Vi; is the effective ion collision frequency. The electron 
diffusion coefficient is typically much larger than the ion diffusion coefficient in an 
unmagnetized plasma (it is the other way around for diffusion across a magnetic field 
in a magnetized plasma where the step size is the Larmor radius). However, if the 
electrons in an unmagnetized plasma did in fact diffuse across a density gradient at 
a rate two orders of magnitude faster than the ions, the ions would be left behind 
and the plasma would no longer be quasi-neutral. What actually happens is that the 
electrons try to diffuse faster than the ions, but an electrostatic electric field is es- 
tablished which decelerates the electrons and accelerates the ions until the electron 
and ion fluxes become equalized. This results in an effective diffusion, called the 
ambipolar diffusion, which is less than the electron rate, but greater than the ion rate. 
Equation (1.21) shows that an electric field establishes an average electron momentum 
MeUe = —eE/ve where ve is the rate at which the average electron loses momen- 
tum due to collisions with ions or neutrals. Electron-electron collisions are excluded 
from this calculation because the average electron under consideration here cannot 
lose momentum due to collisions with other electrons, because the other electrons 
have on average the same momentum as this average electron. Since the electric field 
cannot impart momentum to the plasma as a whole, the momentum imparted to ions 
must be equal and opposite so m;u; = eE/ve. Because diffusion in the presence of 
a density gradient produces an electron flux —De Vne, the net electron flux resulting 
from both an electric field and a diffusion across a density gradient is 


Te = nee E—D:Vne (1.27) 
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where z 
He = — (1.28) 
MeVe 
is called the electron mobility. Similarly, the net ion flux is 
T; = Nipb;E—-DiV ing (1.29) 
where F 
= 1.30 
=a (1.30) 


is the ion mobility. In order to maintain quasineutrality, the electric field automatically 
adjusts itself to give Te = I; = Plambipolar and n; = Ne = n; this ambipolar electric 
field is 
(De — Di) 
(He — Mi) 
De 
~ — Vinn 


e 


To 
ze BAe nn (1.31) 
£ 


Eambipolar Vinn 


Substitution for E gives the ambipolar diffusion to be 


Di = De i 
T ambipolar = (4 zs ) Vn (1.32) 
Me Hi 


so the ambipolar diffusion coefficient is 


Dambipolar =e 


_ ETT) (1.33) 
MiVi 

where Eqs.(1.25) and (1.26) have been used as well as the relation v; ~ (me/m;)"?ve. 
If the electrons are much hotter than the ions, then for a given ion temperature, the 
ambipolar diffusion scales as Te /m;. The situation is a little like that of a small child 
tugging on his/her parent (the energy of the small child is like the electron temperature, 
the parental mass is like the ion mass, and the tension in the arm which accelerates 
the parent and decelerates the child is like the ambipolar electric field); the resulting 
motion (parent and child move together faster than the parent would like and slower 
than the child would like) is analogous to electrons being retarded and ions being ac- 
celerated by the ambipolar electric field in such a way as to maintain quasineutrality. 
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1.12 A quantitative perspective 


Relevant physical constants are 


e = 1.6 x 10-19 Coulombs 

me = 9.1 x 107?! kg 

Mp/Me = 1836 

£o = 8.85 x 1071? Farads/meter. 


The temperature is measured in units of electron volts, so that x = 1.6 x 107 19Joules/volt; 
i.e., k = e. Thus, the Debye length is 


EoKT 


AD = 
ne? 


= Eo ITev 
E e n 


7.4 x 10° 


To V 


meters. (1.34) 


We will assume that the typical velocity is related to the temperature by 


1 3 
3m = z“T- (1.35) 


For electron-electron scattering 4 = Me /2 so that the small angle scattering cross-section 
is 


a Ee z (à /b ) 
2T \ egmv?/2 peeps ee 

1 e? : 

Sie InA l. 
2T (Er) n (ldo) 


where 


AD 
bx /2 


leokT Amegmu? /2 
ne? e2 


= 6rnrd}3, (1.37) 


A = 


is typically a very large number corresponding to there being a macroscopically large num- 
ber of particles in a sphere having a radius equal to a Debye length; different authors will 
have slightly different numerical coefficients, depending on how they identify velocity with 
temperature. This difference is of no significance because one is taking the logarithm. 
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The collision frequency is v = o*nv so 


n e $ SKT A 
Vee = n 
2T \ 3coKT Me 
A e5/2 nln A 
2x 33/27e2me!? ihe 
InA 
= Äx 10 (1.38) 


eV 


Typically In A lies in the range 8-25 for most plasmas. 


Table 1.1 lists nominal parameters for several plasmas of interest and shows these plas- 
mas have an enormous range of densities, temperatures, scale lengths, mean free paths, and 
collision frequencies. The crucial issue is the ratio of the mean free path to the characteris- 
tic scale length. 


Arc plasmas and magnetoplasmadynamic thrusters are in the category of dense lab plas- 
mas; these plasmas are very collisional (the mean free path is much smaller than the char- 
acteristic scale length). The plasmas used in semiconductor processing and many research 
plasmas are in the diffuse lab plasma category; these plasmas are collisionless. It is possi- 
ble to make both collisional and collisionless lab plasmas, and in fact if there is are large 
temperature or density gradients it is possible to have both collisional and collisionless 
behavior in the same device. 


n |T | Abd mrp? | MA | vee | Imfp | L 
units m| eV |m s71 m m 
Solar corona 10 ] 100 | 1073 1107 19 | 102 10° | 108 
(loops) 
Solar wind 10’ | 10 | 10 10° 25 | 1075 | 10% | 10% 


(near earth) 
Magnetosphere | 10* | 10 | 107 101! | 28 | 1078 | 10/4 | 10° 
(tail lobe) 
Ionosphere 1024 | 0.1 | 107? | 104 14 | 10? 10° | 10° 
Mag. fusion | 10 | 107 | 10-* | 107 | 20 | 10" | 10" | 10 
(tokamak) 
Inertial fusion | 107! | 10¢ | 107t° | 102 8 10'* | 1077 | 1075 
(imploded) 


Lab plasma 107° | 5 10% | 10 | 9 108 1077 | 10°? 
(dense) 
Lab plasma 10%" | 5 1074 | 10° | 14 | 104 101 | 10-1 
(diffuse) 


Table 1.1: Comparison of parameters for a wide variety of plasmas 
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1.13 Assignments 


Figure 1.4: Geometry of scattering in center of mass frame. Scattering center is at the origin 
and @ is the scattering angle. Note symmetries of velocities before and after scattering. 


1. Rutherford Scattering: This assignment involves developing a derivation for Ruther- 
ford scattering which uses geometrical arguments to take advantage of the symmetry 
of the scattering trajectory. 


(a) Show that the equation of motion in the center of mass frame is 


dv ug . 

— Fi 

dt Ategr? 
The calculations will be done using the center of mass frame geometry shown in 
Fig.1.4 which consists of a cylindrical coordinate system r, ġ, z with origin at the 
scattering center. Let 0 be the scattering angle, and let b be the impact parameter 
as indicated in Fig.1.4. Also, define a Cartesian coordinate system x, y so that 
y = r sin ¢ etc.; these Cartesian coordinates are also shown in Fig.1.4. 


(b) By taking the time derivative of r x fr show that the angular momentum L = 
ur x t is a constant of the motion. Show that L = ubus = ur?o so that 
b = Wo /r?. 

(c) Let v; and v; be the initial and final velocities as shown in Fig.1.4. Since energy 
is conserved during scattering the magnitudes of these two velocities must be the 
same, i.e., |vi| = |vy| = Væ. From the symmetry of the figure it is seen that 
the x component of velocity at infinity is the same before and after the collision, 
even though it is altered during the collision. However, it is seen that the y 
component of the velocity reverses direction as a result of the collision. Let Avy 
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be the net change in the y velocity over the entire collision. Express Av, in 
terms of v,;, the y component of v;. 


(d) Using the y component of the equation of motion, obtain a relationship between 
dv, and dcos ¢. (Hint: it is useful to use conservation of angular momentum to 
eliminate dt in favor of dd.) Let ¢; and ¢ , be the initial and final values of ¢. 
By integrating du,, calculate Av, over the entire collision. How is @, related to 
Q; and to a (refer to figure)? 


(e) How is vy; related to ~; and voo? How is 0 related to a? Use the expressions 
for Av, obtained in parts (c) and (d) above to obtain the Rutherford scattering 


formula 
tan 0 2 192 
2 Atego bv2, 


What is the scattering angle for grazing (small angle collisions) and how does 
this small angle scattering relate to the initial center of mass kinetic energy and 
to the potential energy at distance 6? For grazing collisions how does 6 relate 
to the distance of closest approach? What impact parameter gives 90 degree 
scattering? 


2. One-dimensional Scattering relations: The separation of collision types according to 
Me /m; can also be understood by considering how the combination of conservation of 
momentum and of energy together constrain certain properties of collisions. Suppose 
that a particle with mass m; and incident velocity vı makes a head-on collision with a 
stationary target particle having mass m2. The conservation equations for momentum 
and energy can be written as 


1 1 
MV = Mvi + Mva 
1 1 1 
2 12 12 
-MV = -Myv + zM. 
Jya T nO ls 


where prime refers to the value after the collision. By eliminating vj between these 
two equations obtain v as a function of vı. Use this to construct an expression show- 
ing the ratio mav /mıv?, i.e., the fraction of the incident particle energy is trans- 
ferred to the target particle per collision. How does this fraction depend on m1 /mg2 
when ™/mzg is equal to unity, very large, or very small? If m/mg is very large 
or very small how many collisions are required to transfer approximately all of the 
incident particle energy to target particles? 


3. Some basic facts you should know: Memorize the value of £o (or else arrange for the 
value to be close at hand). What is the value of Boltzmann’s constant when tempera- 
tures are measured in electron volts? What is the density of the air you are breathing, 
measured in particles per cubic meter? What is the density of particles in solid copper, 
measured in particles per cubic meter? What is room temperature, expressed in elec- 
tron volts? What is the ionization potential (in eV) of a hydrogen atom? What is the 
mass of an electron and of an ion (in kilograms)? What is the strength of the Earth’s 
magnetic field at your location, expressed in Tesla? What is the strength of the mag- 
netic field produced by a straight wire carrying | ampere as measured by an observer 
located | meter from the wire and what is the direction of the magnetic field? What 
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is the relationship between Tesla and Gauss, between particles per cubic centimeter 
and particles per cubic meter? What is magnetic flux? If a circular loop of wire with 
a break in it links a magnetic flux of 29.83 Weber which increases at a constant rate to 
a flux of 30.83 Weber in one second, what voltage appears across the break? 


. Solve Eq.(1.5) the ‘easy’ way by first proving using Gauss’ law to show that the solu- 


tion of j 
V? = ——ô(r) 
E0 
is 
2 el 
~ Aner’ 
Show that this implies 
1 
es = —d(r) (1.39) 


Amr 
is a representation for the delta function. Then, use spherical polar coordinates and 
symmetry to show that the Laplacian reduces to 


1 0 Ob 
Die ko Oe {OP 
WIS r? ðr ¢ =). 
Explicitly calculate V?(1/r) and then reconcile your result with Eq.(1.39). Using 


these results guess that the solution to Eq.(1.5) has the form 


_ g(r) 
Ateor 


Substitute this guess into Eq.(1.5) to obtain a differential equation for g which is trivial 
to solve. 


. Solve Eq.(1.5) for é(r) using a more general method which illustrates several im- 


portant mathematical techniques and formalisms. Begin by defining the 3D Fourier 
transform 


o(k) = i dro(r)e (1.40) 
in which case the inverse transform is 
1 K ikr 
d(r) = nye dkd(k)e (1.41) 
and note that the Dirac delta function can be expressed as 
6(r) = ala fa ikr (1.42) 
= Om een ; 


Now multiply Eq.(1.5) by exp(—ik - r) and then integrate over all r, i.e. operate with 
J dr. The term involving V? is integrated by parts, which effectively replaces the V 
operator with ik. 


Show that the Fourier transform of the potential is 


= = qT 
o(k) = (PPAR) (1.43) 
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and use this in Eq. (1.41). 


Because of spherical symmetry use spherical polar coordinates for the k space integral. 
The only fixed direction is the r direction so choose the polar axis of the k coordinate 
system to be parallel to r. Thus k - r = kra where a = cos 0 and 0 is the polar angle. 
Also, dk = —d@k?dadk where ¢ is the azimuthal angle. What are the limits of the 
respective @, a, and k integrals? In answering this, you should first obtain an integral 


of the form 5 


ote) ~ f 


? ? 
do aa f k?dk x (?) (1.44) 
? a=? k=? 

where the limits and the integrand with appropriate coefficients are specified (i.e., 
replace all the question marks and ~ by the correct quantities). Upon evaluation of 
the ġo and a integrals Eq.(1.44) becomes an even function of k so that the range of 
integration can be extended to —oo providing the overall integral is multiplied by 1/2. 
Realizing that sin kr =Im[e'*"], derive an expression of the general form 


oo eikr 
g(r ~im f kdk—~. (1.45) 
pegs er Co 
but specify the coefficient and exact form of f (k?). Explain why the integration con- 
tour (which is along the real k axis) can be completed in the upper half complex k 
plane. Complete the contour in the upper half plane and show that the integrand has a 


single pole in the upper half plane at k =? Use the method of residues to obtain ¢(r). 


. Make sure you know how to evaluate quickly A x (B x C) and (AxB)xC. A use- 
ful mnemonic which works for both cases is: “Both variations = Middle (dot other 
two) - Outer (dot other two)”, where outer refers to the outer vector of the parentheses 
(furthest from the center of the triad), and middle refers to the middle vector in the 
triad of vectors. 


. Particle Integrator scheme (Birdsall and Langdon 1985)-In this assignment you will 
develop a simple, but powerful “leap-frog” numerical integration scheme. This is a 
type of “implicit” numerical integration scheme. This numerical scheme can later 
be used to evaluate particle orbits in time-dependent fields having complex topology. 
These calculations can be considered as numerical experiments used in conjunction 
with the analytic theory we will develop. This combined analytical/numerical ap- 
proach provides a deeper insight into charged particle dynamics than does analysis 
alone. 


Brief note on Implicit v. Explicit numerical integration schemes 


Suppose it is desired use numerical methods to integrate the equation 


Unfortunately, since y(t) is the sought-after quantity , we do not know what to use in 
the right hand side for y(t). A naive choice would be to use the previous value of y in 
the RHS to get a scheme of the form 


Ynew — Yold 
== i a o yt 
X f (Yoa, t) 


26 


Chapter 1. Basic concepts 


which may be solved to give 


Ynew = Vold + At f (Yoa, t) 
Simple and appealing as this is, it does not work since it is numerically unstable. 


However, if we use the following scheme we will get a stable result: 


Ynew — Yold 

AR = f (new + Yoid) /2, t) (1.46) 
In other words, we have used the average of the new and the old values of y in the 
RHS. This makes sense because the RHS is a function evaluated at time t whereas 
Ynew = y(t + At/2) and yoia = y(t —At/2). If Taylor expand these last quantities are 
Taylor expanded, it is seen that to lowest order y(t) = [y(t + At/2) +y(t— At/2)]/2. 
Since Ynew Occurs on both sides of the equation we will have to solve some sort of 
equation, or invert some sort of matrix to get Ynew- 
Start with 

dv 

Define, the angular cyclotron frequency vector Q =qB /m and the normalized electric 
field & = gE/m so that the above equation becomes 


— =N4+vxn (1.47) 
Using the implicit scheme of Eq.(1.46), show that Eq. (1.47) becomes 


View TAX Vie C 


where A = QAt/2 and C = voa t At (X + Voia X 2/2). By first dotting the above 
equation with A and then crossing it with A show that the new value of velocity is 
given by 

_C+AA-C-AXxC 


The new position is simply given by 


Xnew = Xold + Vnew dt 


The above two equations can be used to solve charged particle motion in complicated, 
3D, time dependent fields. Use this particle integrator to calculate the trajectory of 
an electron moving in crossed electric and magnetic fields where the non-vanishing 
components are Ey = 1 volt/meter and B, = 1 Tesla. Plot your result graphically on 
your computer monitor. Try varying the field strengths, polarities, and also try ions 
instead of electrons. 


. Use the leap-frog numerical integration scheme to demonstrate the Rutherford scat- 


tering problem: 


(1) Define a characteristic length for this problem to be the impact parameter for a 90 
degree scattering angle, 6,/2. A reasonable choice for the characteristic velocity is 
Uoo. What is the characteristic time? 
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(ii) Define a Cartesian coordinate system such that the z axis is parallel to the incident 
relative velocity vector Və and goes through the scattering center. Let the impact 
parameter be in the y direction so that the incident particle is traveling in the y — z 
plane. Make the graphics display span —50 < z/b,/2 < 50 and —50 < y/br/2 < 50. 


(iii) Set the magnetic field to be zero, and let the electric field be 
E= -Vo 


where ġ =? so Ez =? etc. 


(iv) By using r? = x? + y? + 2? calculate the electric field at each particle position, 
and so determine the particle trajectory. 


(v) Demonstrate that the scattering is indeed at 90 degrees when b = 6,/2. What 
happens when b is much larger or much smaller than 0, /2?? What happens when q1,q2 
have the same or opposite signs? 


(vi) Have your code draw the relevant theoretical scattering angle 0, and show that the 
numerical result is in agreement. 


. Collision relations- Show that ontlm yp = 1 where ø is the cross-section for a colli- 


sion, n, is the density of target particles and lm fp is the mean free path. Show also 
that the collision frequency is given by v = on,v where v is the velocity of the in- 
cident particle. Calculate the electron-electron collision frequency for the following 
plasmas: fusion (n ~ 10?°m-?, T ~ 10 keV), partially ionized discharge plasma 
(n ~ 10'6m-3, T ~ 10 eV). At what temperature does the conductivity of plasma 
equal that of copper, and of steel? Assume that Z = 1. 


10. Cyclotron motion- Suppose that a particle is immersed in a uniform magnetic field 


1 


B = B2 and there is no electric field. Suppose that at t = O the particle’s initial 
position is at x = O and its initial velocity is v = voĉ. Using the Lorentz equation, 
calculate the particle position and velocity as a function of time (be sure to take initial 
conditions into account). What is the direction of rotation for ions and for electrons 
(right handed or left handed with respect to the magnetic field)? If you had to make 
up a mnemonic for the sense of ion rotation, would it be Lions or Rions? Now, repeat 
the analysis but this time with an electric field E = ĉEo cos(wt). What happens in the 
limit where w — Q where Q = qB/m is the cyclotron frequency? Assume that the 
particle is a proton and that B = 1 Tesla, vo = 10°m/s, and compare your results with 
direct numerical solution of the Lorentz equation. Use Eo = 10* V/m for the electric 
field. 


. Space charge limited current- When a metal or metal oxide is heated to high tem- 


peratures it emits electrons from its surface. This process called thermionic emission 
is the basis of vacuum tube technology and is also essential when high currents are 
drawn from electrodes in a plasma. The electron emitting electrode is called a cath- 
ode while the electrode to which the electrons flow is called an anode. An idealized 
configuration is shown in Fig.1.5. 
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cathode] @» anode 


f 


electrons emitted 


space charge 
from cathode surface 


Figure 1.5: Electron cloud accelerated from cathode to anode encounters space charge of 
previously emitted electrons. 


This configuration can operate in two regimes: (i) the temperature limited regime 
where the current is determined by the thermionic emission capability of the cathode, 
and (ii) the space charge limited regime, where the current is determined by a buildup 
of electron density in the region between cathode and anode (inter-electrode region). 
Let us now discuss this space charge limited regime: If the current is small then the 
number of electrons required to carry the current is small and so the inter-electrode 
region is nearly vacuum in which case the electric field in this region will be nearly 
uniform and be given by Æ = V/d where V is the anode-cathode potential difference 
and d is the anode cathode separation. This electric field will accelerate the electrons 
from anode to cathode. However, if the current is large, there will be a significant 
electron density in the inter-electrode region. This space charge will create a localized 
depression in the potential (since electrons have negative charge). The result is that 
the electric field will be reduced in the region near the cathode. If the space charge is 
sufficiently large, the electric field at the cathode vanishes. In this situation attempting 
to increase the current by increasing the number of electrons ejected by the cathode 
will not succeed because an increase in current (which will give an increase in space 
charge) will produce a repulsive electric field which will prevent the additional elec- 
trons from leaving the cathode. Let us now calculate the space charge limited current 
and relate it to our discussion on Debye shielding. The current density in this system 
is 
J = —n(x)ev(x) = a negative constant 

Since potential is undefined with respect to a constant, let us choose this constant so 
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that the cathode potential is zero, in which case the anode potential is Vp. Assuming 
that electrons leave the cathode with zero velocity, show that the electron velocity as 
a function of position is given by 


2eV (x) 


Me 


v(x) = 


Show that the above two equations, plus Poisson’s equation, can be combined to give 
the following differential equation for the potential 
2 

aY oy icy 

dz? 
where \ = e ' |J| V/me/2e. By multiplying this equation with the integrating factor 
dV/dz and using the space charge limited boundary condition that Æ = Oat x = 0, 
solve for V(x). By rearranging the expression for V(x) show that the space charge 


limited current is 
jaa ee y3/2 
=e e 
9V me @ 
This is called the Child-Langmuir space charge limited current. For reference the 
temperature limited current is given by the Richardson-Dushman law, 


J = AT? eto" 


where coefficient A and the work function ġo are properties of the cathode mate- 
rial, while T is the cathode temperature. Thus, the actual cathode current will be 
whichever is the smaller of the above two expressions. Show there is a close relation- 
ship between the physics underlying the Child-Langmuir law and Debye shielding 
(hint-characterize the electron velocity as being a thermal velocity and its energy as 
being a thermal energy, show that the inter-electrode spacing corresponds to ?). Sup- 
pose that a cathode was operating in the space charge limited regime and that some 
positively charged ions were placed in the inter-electrode region. What would happen 
to the space charge-would it be possible to draw more or less current from the cath- 
ode? Suppose the entire inter-electrode region were filled with plasma with electron 
temperature T}. What would be the appropriate value of d and how much current could 
be drawn from the cathode (assuming it were sufficiently hot)? Does this give you any 
ideas on why high current switch tubes (called ignitrons) use plasma to conduct the 
current? 


2 


Derivation of fluid equations: Vlasov, 2-fluid, 
MHD 


2.1 Phase-space 


Consider a particle moving in a one-dimensional space and let its position be described 
as x = x(t) and its velocity asv = v(t). A way to visualize the x and v trajectories 
simultaneously is to plot them on a 2-dimensional graph where the horizontal coordinate is 
given by x(t) and the vertical coordinate is given by u(t). This x —v plane is called phase- 
space. The trajectory (or orbit) of several particles can be represented as a set of curves 
in phase-space as shown in Fig.2.1. Examples of a few qualitatively different phase-space 
orbits are shown in Fig.2.1. 


Vv 


particle phase-space position passing particle orbit 
at time t (positive velocity) 


periodic orbit 


quasi-periodic orbit laa 
© Me 
passing particle orbit 
negative velocity) 


Figure 2.1: Phase space showing different types of possible particle orbits. 


Particles in the upper half plane always move to the right since they have a positive 
velocity while those in the lower half plane always move to the left. Particles having exact 
periodic motion [e.g., x = Acos(wt), v = —wAsin(wt)] alternate between moving to the 
right and the left and so describe an ellipse in phase-space. Particles with nearly periodic 
(quasi-periodic) motions will have near-ellipses or spiral orbits. A particle that does not 
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reverse direction is called a passing particle, while a particle confined to a certain region of 
phase-space (e.g., a particle with periodic motion) is called a trapped particle. 


2.2 Distribution function and Vlasov equation 


At any given time, each particle has a specific position and velocity. We can therefore char- 
acterize the instantaneous configuration of a large number of particles by specifying the 
density of particles at each point x,v in phase-space. The function prescribing the instan- 
taneous density of particles in phase-space is called the distribution function and is denoted 
by f(x,v,t). Thus, f(x, v,t)dadv is the number of particles at time t having positions in 
the range between x and x + dz and velocities in the range between v and v + dv. As time 
progresses, the particle motion and acceleration causes the number of particles in these x 
and v ranges to change and so f will change. This temporal evolution of f gives a descrip- 
tion of the system more detailed than a fluid description, but less detailed than following 
the trajectory of each individual particle. Using the evolution of f to characterize the sys- 
tem does not keep track of the trajectories of individual particles, but rather characterizes 
classes of particles having the same x, v. 


Figure 2.2: A box with in phase space having width dz and height dv. 


Now consider the rate of change of the number of particles inside a small box in phase- 
space such as is shown in Fig.2.2. Defining a(x, v, t) to be the acceleration of a particle, 
it is seen that the particle flux in the horizontal direction is fv and the particle flux in the 
vertical direction is fa. Thus, the particle fluxes into the four sides of the box are: 

1. Flux into left side of box is f(x, v, t)udu 


2. Flux into right side of box is — f (x + dz, v, t)udu 
3. Flux into bottom of box is f(z, v, t)a(x,v, t)dx 


4. Flux into top of box is — f(x, v + du, t)a(x,v + du, t)dz 
The number of particles in the box is f(x,v,t)dadv so that the rate of change of parti- 
cles in the box is 


32 Chapter 2. Derivation of fluid equations: Vlasov, 2-fluid, MHD 


drdvu = —f(x+dz,v,t)vdu + f(ax,v,t)vdu 
—f(x,v + du, t)a(x,v + dv, t)dx 
+f(x,v,t)a(ax,v,t)da 
or, on Taylor expanding the quantities on the right hand side, we obtain the one dimensional 
Vlasov equation, 


(2.1) 


of of ə 
Ap ae Bn (af) =0. (2.2) 


It is straightforward to generalize Eq.(2.2) to three dimensions and so obtain the three- 
dimensional Vlasov equation 


OF ges OF 0s yea 
apn ies tag (af) =0. (2.3) 


Because x, v are independent quantities in phase-space, the spatial derivative term has the 
commutation property: 


of fô) 

s e : 2.4 

Ox. Ox (vf) ee) 
The particle acceleration is given by the Lorentz force 


a=(E+vxB) . (2.5) 
m 
Because (v x B); = v;Bpg — vk B; is independent of v;, the term O(v x B),/Ov; vanishes 


so that even though the acceleration a is velocity-dependent, it nevertheless commutes with 
the vector velocity derivative as 


of o 
pe ee : 2. 
“Oy Əv (af) C8) 
Because of this commutation property the Vlasov equation can also be written as 
of of of 
Hv- Ha = 0 2 2T 
ot Ox Ov em 


If we “sit on top of” a particle moving in phase-space with trajectory x = x(t), v = v(t) 
and measure the distribution function as we are carried along by the particle, the ob- 
served rate of change of the distribution function will be df (x(t), v(¢),t)/dt where the 
d/dt means that the derivative is measured in the moving frame. Because dx/dt = v and 
dv/dt = a, this observed rate of change is 


a ae Of |, of 


al = eet v- Ha- = 0. (2.8) 


Thus, the distribution function as measured when moving along a particle trajectory (orbit) 
is a constant. This gives a powerful method for finding solutions to the Vlasov equation. 
Since the distribution function is a constant when measured in the frame following an orbit, 
we can choose it to depend on any quantity that is constant along the orbit (Jeans 1915, 
Watson 1956). 
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For example, if the energy F of particles is constant along their orbits then f = f(E) is 
a solution to the Vlasov equation. On the other hand, if both the energy and the momen- 
tum p are constant along particle orbits, then any distribution function with the functional 
dependence f = f(E, p) is a solution to the Vlasov equation. Depending on the situation 
at hand, the energy and/or canonical momentum may or may not be constant along an or- 
bit and so whether or not f = f(E, p) is a solution to the Vlasov equation depends on the 
specific problem under consideration. However, there always exists at least one constant of 
the motion for any trajectory because, just like every human being has an invariant birth- 
day, the initial conditions of a particle trajectory are invariant along its orbit. As a simple 
example, consider a situation where there is no electromagnetic field so that a =0 in which 
case the particle trajectories are simply x(t) = x)+vot, v(t) = Vp) where xo, Vo are the 
initial position and velocity. Let us check to see whether f (xo) is indeed a solution to the 
Vlasov equation. Write xo= x(t) — vot so f(xo) = f(x(t) — vot) and observe that 
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Figure 2.3: Moments give weighted averages of the particles in the shaded vertical strip 


2.3 Moments of the distribution function 


Let us count the particles in the shaded vertical strip in Fig.2.3. The number of particles in 
this strip is the number of particles lying between x and x + dz where z is the location of 
the left hand side of the strip and x + dz is the location of the right hand side. The number 
of particles in the strip is equivalently defined as n(x,t)dx where n(x) is the density of 
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particles at x. Thus we see that f f(x,v)dv = n(x); the transition from a phase-space 
description (i.e., x,v are dependent variables) to a normal space description (i.e., £ is a 
dependent variable) involves “integrating out” the velocity dependence to obtain a quantity 
(e.g., density) depending only on position. Since the number of particles is finite, and since 
f is a positive quantity, we see that f must vanish as v — oo. 

Another way of viewing f is to consider it as the probability that a randomly selected 
particle at position x has the velocity v. Using this point of view, we see that averaging over 
the velocities of all particles at x gives the mean velocity u(x) determined by n(x)u(x) = 
J vf («,v)dv. Similarly, multiplying f by v? and integrating over velocity will give an 
expression for the mean energy of all the particles. This procedure of multiplying f by 
various powers of v and then integrating over velocity is called taking moments of the 
distribution function. 

It is straightforward to generalize this “moment-taking” to three dimensional problems 
simply by taking integrals over three-dimensional velocity space. Thus, in three dimensions 
the density becomes 


n(x) = [te v)dv (2.10) 
and the mean velocity becomes 
B J vf(x,v)dv 
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Figure 2.4: Detailed view of collisions causing ‘jumps’ in phase space 


2.3.1 Treatment of collisions in the Vlasov equation 


It was shown in Sec. 1.8 that the cumulative effect of grazing collisions dominates the 
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cumulative effect of the more infrequently occurring large angle collisions. In order to 
see how collisions affect the Vlasov equation, let us now temporarily imagine that the 
grazing collisions are replaced by an equivalent sequence of abrupt large scattering angle 
encounters as shown in Fig.2.4. Two particles involved in a collision do not significantly 
change their positions during the course of a collision, but they do substantially change their 
velocities. For example, a particle making a head-on collision with an equal mass stationary 
particle will stop after the collision, while the target particle will assume the velocity of 
the incident particle. If we draw the detailed phase-space trajectories characterized by a 
collision between two particles we see that each particle has a sudden change in its vertical 
coordinate (i.e., velocity) but no change in its horizontal coordinate (i.e., position). The 
collision-induced velocity jump occurs very fast so that if the phase-space trajectories were 
recorded with a “movie camera” having insufficient framing rate to catch the details of the 
jump the resulting movie would show particles being spontaneously created or annihilated 
within given volumes of phase-space (e.g., within the boxes shown in Fig. 2.4). 

The details of these individual jumps in phase-space are complicated and yet of little 
interest since all we really want to know is the cumulative effect of many collisions. It 
is therefore both efficient and sufficient to follow the trajectories on the slow time scale 
while accounting for the apparent “creation” or “annihilation” of particles by inserting a 
collision operator on the right hand side of the Vlasov equation. In the example shown 
here it is seen that when a particle is apparently “created” in one box, another particle must 
be simultaneously “annihilated” in another box at the same x coordinate but a different 
v coordinate (of course, what is actually happening is that a single particle is suddenly 
moving from one box to the other). This coupling of the annihilation and creation rates in 
different boxes constrains the form of the collision operator. We will not attempt to derive 
collision operators in this chapter but will simply discuss the constraints on these operators. 
From a more formal point of view, collisions are characterized by constrained sources and 
sinks for particles in phase-space and inclusion of collisions in the Vlasov equation causes 
the Vlasov equation to assume the form 


o O 
a + Og Wie) + Hy (ale) = Lo Coal fo) 2.12) 


where Coal fo) is the rate of change of fo due to collisions of species o with species a. 
Let us now list the constraints which must be satisfied by the collision operator Coa (fo) 
are as follows: 


e (a) Conservation of particles — Collisions cannot change the total number of par- 
ticles at a particular location so 


f evCrolfe) =0. (2.13) 


(b) Conservation of momentum — Collisions between particles of the same species 
cannot change the total momentum of that species so 


J vmov Coola) =0 (2.14) 
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while collisions between different species must conserve the total momentum 
of both species together so 


[aves + [avmevGul Fe) = 0. (2.15) 


(c) Conservation of energy —Collisions between particles of the same species can- 
not change the total energy of that species so 


J dvmov?’ Crol fo) =0 (2.16) 


while collisions between different species must conserve the total energy of 
both species together so 


J dvm;v? Ciel fi) + J dvmev? Ceil fe) = 0. (2.17) 


2.4 Two-fluid equations 


Instead of just taking moments of the distribution function f itself, moments will now 
be taken of the entire Vlasov equation to obtain a set of partial differential equations re- 
lating the mean quantities n(x), u(x), etc. We begin by integrating the Vlasov equation, 
Eq.(2.12), over velocity for each species. This first and simplest step in the procedure is 
often called taking the “zeroth” moment, since we are multiplying by unity which for con- 
sistency with later “moment-taking”, can be considered as multiplying the entire Vlasov 
equation by v raised to the power zero. Multiplying the Vlasov equation by unity and then 
integrating over velocity gives 


f EE + 2 (vfo) + 2 ; (afo) dv = D | Coeli). (2.18) 


The velocity integral commutes with both the time and space derivatives on the left hand 
side because x, v, and t are independent variables, while the third term on the left hand side 
is the volume integral of a divergence in velocity space. Gauss’s theorem [i.e., J, o OX > 
Q= Jz fe ds - Q] gives fə evaluated on a surface at v = co. However, because fọ — 0 
as v — ov, this surface integral in velocity space vanishes. Using Eqs.(2.10), (2.11), and 
(2.13), we see that Eq.(2.18) becomes the species continuity equation 


Ong 
ot 
Now let us multiply Eq.(2.12) by v and integrate over velocity to take the “first moment”, 


+V- (nous) =0. (2.19) 


fy E au 2 - (vfo) + 2 ; (afo) dv = D | Vrav: (2.20) 


This may be re-arranged in a more tractable form by: 
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(i) “pulling” both the time and space derivatives out of the velocity integral, 


(ii) writing v = v'(x, t) + u(x,t) where v’(x,¢) is the random part of a given velocity, 
i.e., that part of the velocity which differs from the mean (note that v is independent 
of both x and ¢ but v’ is not; also dv =dv’), 


(iil) integrating by parts in 3-D velocity space on the acceleration term and using 


x) 
—— = Diz: 
G ij 


After performing these manipulations, the first moment of the Vlasov equation be- 


comes ə ə 
2 (ete) +2. J Gal Ase aie aay 


(2.21) 
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where Rosa is the net frictional drag force due to collisions of species o with species a. 
Note that Roc = O since a species cannot exert net drag on itself (e.g., the totality of 
electrons cannot cause frictional drag on the totality of electrons). The frictional terms 
have the form 

Rei = VeiMeNe (Ue = u;i) (2.22) 


Rie = Viemini(u; — Ue) (2.23) 
so that in the ion frame the drag on electrons is simply the total electron momentum 
MeNeUe Measured in this frame multiplied by the rate ve; at which this momentum is 
destroyed by collisions with ions. This form for frictional drag has the following proper- 
ties: (i) Rei + Rie = 0 showing that the plasma cannot have a frictional drag on itself, (ii) 
friction causes the faster species to be slowed down by the slower species, and (iii) there is 
no friction between species if both have the same mean velocity. 

Equation (2.21) can be further simplified by factoring u out of the velocity integrals 
and recalling that by definition f v'fẹdv’ =0 . Thus, Eq. (2.21) reduces to 


8 (NoUs) ð = ð > 
Mo LOE + ax $ (nett) =No Vo (E + us XB) “px - Po—Roa (2.24) 


where the pressure tensor P is defined by 
— 
Po=™Me [vv foaw’ 


L PEDE 
If f, is an isotropic function of v’, then the off-diagonal terms in P , vanish and the three 
diagonal terms are identical. In this case, it is useful to define the diagonal terms to be the 
scalar pressure P,,1.¢., 


Ps =M¢ fvw, fody’ = mo J vyv, foadv' = Mo f viv) fodv' 


S (2.25) 
= a v'- v' fdv’. 
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Equation (2.25) defines pressure for a three-dimensional isotropic system. However, we 
will often deal with systems of reduced dimensionality, i.e., systems with just one or two 
dimensions. Equation (2.25) can therefore be generalized to these other cases by introduc- 
ing the general N-dimensional definition for scalar pressure 


Mo , 


N 
Py = e fod v! =e f ep haa (2.26) 
j=l 


where v’ is the N -dimensional random velocity. 

It is important to emphasize that assuming isotropy is done largely for mathematical 
convenience and that in real systems the distribution function is often quite anisotropic. 
Collisions, being randomizing, drive the distribution function towards isotropy, while com- 
peting processes simultaneously drive it towards anisotropy. Thus, each situation must be 
considered individually in order to determine whether there is sufficient collisionality to 
make f isotropic. Because fully-ionized hot plasmas often have insufficient collisions to 
make f isotropic, the oft-used assumption of isotropy is an oversimplification which may 
or may not be acceptable depending on the phenomenon under consideration. 

On expanding the derivatives on the left hand side of Eq.(2.24), it is seen that two of 
the terms combine to give u times Eq. (2.19). After removing this embedded continuity 
equation, Eq.(2.24) reduces to 


du, 
nomo “2 =noqo (E + us xB) —VP, — Roa (2.27) 


where the operator d/dtis defined to be the convective derivative 


d ð 
— = — o: V 22) 
rr ai (2.28) 


which characterizes the temporal rate of change seen by an observer moving with the mean 

fluid velocity u, of species o.An everyday example of the convective term would be the 
apparent temporal increase in density of automobiles seen by a motorcyclist who enters a 
traffic jam of stationary vehicles and is not impeded by the traffic jam. 

At this point in the procedure it becomes evident that a certain pattern recurs for each 
successive moment of the Vlasov equation. When we took the zeroth moment, an equation 
for the density f f,dv resulted, but this also introduced a term involving the next higher 
moment, namely the mean velocity ~ f v fdv. Then, when we took the first moment to 
get an equation for the velocity, an equation was obtained containing a term involving the 
next higher moment, namely the pressure ~ f vv f,dv. Thus, each time we take a moment 
of the Vlasov equation, an equation for the moment we want is obtained, but because of the 
v- Vf term in the Vlasov equation, a next higher moment also appears. Thus, moment- 
taking never leads to a closed system of equations; there is always be a “loose end”, a 
highest moment for which there is no determining equation. Some sort of ad hoc closure 
procedure must always be invoked to terminate this chain (as seen below, typical closures 
involve invoking adiabatic or isothermal assumptions). Another feature of taking moments 
is that each higher moment has embedded in it a term which contains complete lower 
moment equations multiplied by some factor. Algebraic manipulation can identify these 
lower moment equations and eliminate them to give a simplified higher moment equation. 
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Let us now take the second moment of the Vlasov equation. Unlike the zeroth and 
first moments, here the dimensionality of the system enters explicitly so the more general 
pressure definition given by Eq. (2.26) will be used. Multiplying the Vlasov equation by 
Mov? /2 and integrating over velocity gives 


O Mov? 
Sg, g; -aN 
aL J fod v 


2 
O Mev? Fe = y2 7 
taf gee = [mo Coafed v. (2.29) 
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We now consider each term of this equation separately as follows: 
1. The time derivative term becomes 


O fma? 0 f mo (v'+u,)? 0 (NP, Monsu? 
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2. Again using v = v’ + u, the space derivative term becomes 


=f 
Ox 


m v’? 
where Q, = is Se ved ¥ is called the heat flux. 


2 
Mog vfsd™v =V. (2. Pou, 4 


3. On integrating by parts, the acceleration term becomes 
v? ð N 
de | 5 y` (E +v x B) f.|d°v=—-q | v-Ef,dv = —qoncUs - E. 
Vv 


4. The collision term becomes (using Eq.(2.16)) 


v? v? OW 
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Dj” 2 f à J,” 2 f ž (S Ta 
where (8W/ô3t) 


species a. 
Combining the above four relations, Eq.(2.29) becomes 


ð (NP, onou? 2+N onou 
( p Teete) Lv (0, = Prus + erate, ) doNoUg - E 


Eca ÍS the rate at which species ø collisionally transfers energy to 


Ot 2 2 2 


“(aw 
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(2.30) 


This equation can be simplified by invoking two mathematical identities, the first of which 
is 
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(2.31) 
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The second identity is obtained by dotting the equation of motion with u, and is 


ð (u? už 
NoMo E (=) +u,- (v (=) —u, x Vx u- )| 
= MNodcUc - BE — us Vi- Ro Us 


or 


d aus 
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Inserting Eqs. (2.31) and (2.32) in Eq.(2.30) gives the energy evolution equation 
NdaP, 24N Ow 
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The first term on the right hand side represents the heat flux, the second term gives the 
frictional heating of species ø due to frictional drag on species a, while the last term on 
the right hand side gives the rate at which species ø collisionally transfers energy to other 
species. Although Eq.(2.33) is complicated, two important limiting situations become ev- 
ident if we let t be the characteristic time scale for a given phenomenon and / be its char- 
acteristic length scale. A characteristic velocity V,, ~ l/t may then be defined for the 
phenomenon and so, replacing temporal derivatives by t~! and spatial derivatives by I~! 
in Eq.(2.33), it is seen that the two limiting situations are: 

1. Isothermal limit — The heat flux term dominates all other terms in which case the 
temperature becomes spatially uniform. This occurs if (i) vrs >> Vpn since the ratio 
of the left hand side terms to the heat flux term is ~ Vpn /UTo and (ii) the collisional 
terms are small enough to be ignored. 


2. Adiabatic limit — The heat flux terms and the collisional terms are small enough to 
be ignored compared to the left hand side terms; this occurs when Vph >> Vro. 
Adiabatic is a Greek word meaning ‘impassable’, and is used here to denote that no 
heat is flowing. 

Both of these limits make it possible to avoid solving for Q, which involves the third 
moment and so both the adiabatic and isothermal limit provide a closure to the moment 
equations. 

The energy equation may be greatly simplified in the adiabatic limit by re-arranging the 
continuity equation to give 


1 dn, 
V-u,=-— 2.34 
k Ng dt ( ) 
and then substituting this expression into the left hand side of Eq.(2.33) to obtain 
1 dP, y dno 
= 2: 
P, dt No dt 22) 
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ao (2.36) 
Equation (2.35) may be integrated to give the adiabatic equation of state 
Ps 
— = constant; (2.37) 
No 


this can be considered a derivation of adiabaticity based on geometry and statistical me- 
chanics rather than on thermodynamic arguments. 
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2.4.1 Entropy of a distribution function 


Collisions cause the distribution function to tend towards a simple final state characterized 
by having the maximum entropy for the given constraints (e.g., fixed total energy). To 
see this, we provide a brief discussion of entropy and show how it relates to a distribution 
function. 

Suppose we throw two dice, labeled A and B, and let R denote the result of a throw. 
Thus œF ranges from 2 through 12. The complete set of (A, B) combinations that give these 
R’s are listed below: 


R=2< (1,1) 

R =34—>(1,2),(2,1) 

R =4< > (1,3),(3,1),(2,2) 
R=5>(1,4),(4,D,(2,3),(,2) 

R =6 == (1,5),(5,1),(2,4),(4,2),(3,3) 
R =7<==>(1,6),(6,1),(2,5),(5,2),(3,4),(4,3) 
R =84—>(2,6),(6,2),(3,5),(5,3),(4,4) 
R =9<— >(3,6),(6,3),(4,5),(5,4) 

R =10=>(4,6),(6,4),(5,5) 
R=11=>(5,6),(6,5) 

R =124—> (6,6) 


There are six (A, B) pairs that give R =7, but only one pair for R =2 and only one pair 
for R =12. Thus, there are six microscopic states [distinct (A, B) pairs] corresponding to 
R =7 but only one microscopic state corresponding to each of R =2 or R =12. Thus, 
we know more about the microscopic state of the system if R = 2 or 12 than if R = 7. 
We define the entropy S to be the natural logarithm of the number of microscopic states 
corresponding to a given macroscopic state. Thus for the dice, the entropy would be the 
natural logarithm of the number of (A, B) pairs that correspond to a give R. The entropy 
for R = 2 or R = 12 would be zero since S = In(1) = 0, while the entropy for R = 7 
would be S = In(6) since there were six different ways of obtaining R = 7. 

If the dice were to be thrown a statistically large number of times the most likely result 
for any throw is R = 7; this is the macroscopic state with the most number of microscopic 
states. Since any of the possible microscopic states is an equally likely outcome, the most 
likely macroscopic state after a large number of dice throws is the macroscopic state with 
the highest entropy. 

Now consider a situation more closely related to the concept of a distribution function. 
In order to do this we first pose the following simple problem: Suppose we have a pegboard 
with M holes, labeled hy, ho, ...h,; and we also have M pegs labeled by p,, pa, ..., PN- 
What are the number of ways of putting all the pegs in all the holes? Starting with hole 
hı, we have a choice of M different pegs, but when we get to hole hz there are now only 
N — 1 pegs remaining so that there are now only M — 1 choices. Using this argument for 
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subsequent holes, we see there are M! ways of putting all the pegs in all the holes. 

Let us complicate things further. Suppose that we arrange the holes in M groups, say 
group G has the first 10 holes, group Gz has the next 19 holes, group G3 has the next 4 
holes and so on, up to group M. We will use f to denote the number of holes in a group, 
thus f(1) = 10, f(2) = 19, f(3) = 4, etc. The number of ways of arranging pegs within 
a group is just the factorial of the number of pegs in the group, e.g., the number of ways 
of arranging the pegs within group 1 is just 10! and so in general the number of ways of 
arranging the pegs in the 7“” group is [f(7)]!. 

Let us denote C'as the number of ways for putting all the pegs in all the groups without 
caring about the internal arrangement within groups. The number of ways of putting the 
pegs in all the groups caring about the internal arrangements in all the groups is C x f(1)! x 
f (2)! x ...f(M)!, but this is just the number of ways of putting all the pegs in all the holes, 
bes 


C x f(1)! x f(2)! x ...f (M)! =N! 


N! 
FD! x F2)! x (ME 
Now C is just the number of microscopic states corresponding to the macroscopic state 
of the prescribed grouping f (1) = 10, f(2) = 19, f(3) = 4, etc. so the entropy is just 
S=InC or 


or 


N! 
S = In ( i ) 
f)! x f(2)! x ...f(M)! (2.38) 
= InN!—Inf(1)!—mf(2)!—...—Inf(M)! 

Stirling’s formula shows that the large argument asymptotic limit of the factorial function 
is 

lim lnk!= klnk- k. (2.39) 

k— large 


Noting that f (1) + f (2) +...f(M) = N the entropy becomes 


S = NinN — fO) m f) — f2) n F2) —... — f(M) n f(M) 


M 
NinN — X FG) fC) (2.40) 


j=l 
The constant M ln M is often dropped, giving 


M 


S=-)S > f) m f). (2.41) 


j=l 


If j is made into a continuous variable say, 7 — v so that f(v)dv is the number of items in 
the group labeled by v, then the entropy can be written as 


S= - f wte) ln f(v). (2.42) 
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By now, it is obvious that f could be the velocity distribution function in which case f(v)du 
is just the number of particles in the group having velocity between v and v + dv. Since the 
peg groups correspond to different velocity ranges coordinates, having more dimensions 
just means having more groups and so for three dimensions the entropy generalizes to 


S= - fav f(v) In f(v). (2.43) 


If the distribution function depends on position as well, this corresponds to still more peg 
groups, and so a distribution function which depends on both velocity and position will 
have the entropy 


S= - fax fav few) In f(x,v). (2.44) 
2.4.2 Effect of collisions on entropy 


The highest entropy state is the most likely state of the system because the highest entropy 
state has the most number of microscopic states corresponding to the macroscopic state. 
Collisions (or other forms of randomization) will take some prescribed initial microscopic 
state where phase-space positions of all particles are individually specified and scramble 
these positions to give a new microscopic state. The new scrambled state could be any 
microscopic state, but is most likely to be a member of the class of microscopic states 
belonging to the highest entropy macroscopic state. Thus, any randomization process such 
as collisions will cause the system to evolve towards the maximum entropy macroscopic 
state. 

An important shortcoming of this argument is that it neglects any conservation rela- 
tions that have to be satisfied. To see this, note that the expression for entropy could be 
maximized if all the particles are put in one group, in which case C = N!, which is the 
largest possible value for C. Thus, the maximum entropy configuration of M plasma parti- 
cles corresponds to all the particles having the same velocity. However, this would assign 
a specific energy to the system which would in general differ from the energy of the initial 
microstate. This maximum entropy state is therefore not accessible in isolated system, be- 
cause energy would not be conserved if the system changed from its initial microstate to 
the maximum entropy state. 

Thus, a qualification must be added to the argument. Randomizing processes will 
scramble the system to attain the state of maximum entropy state consistent with any con- 
straints placed on the system. Examples of such constraints would be the requirements that 
the total system energy and the total number of particles must be conserved. We therefore 
re-formulate the problem as: given an isolated system with M particles in a fixed volume V 
and initial average energy per particle (Æ) , what is the maximum entropy state consistent 
with conservation of energy and conservation of number of particles? This is a variational 
problem because the goal is to maximize S' subject to the constraint that both V and N (EF) 
are fixed. The method of Lagrange multipliers can then be used to take into account these 
constraints. Using this method the variational problem becomes 


65 — AN — Az 6(N (E)) =0 (2.45) 


where A; and Az are as-yet undetermined Lagrange multipliers. The number of particles 
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N=V f fav. (2.46) 


The energy of an individual particle is Æ = mv?/2 where v is the velocity measured in 
the rest frame of the center of mass of the entire collection of M particles. Thus, the total 
kinetic energy of all the particles in this rest frame is 


2. 
N(E)=V a f(v)dv (2.47) 


and so the variational problem becomes 


2 
5 f aw (rms Guy wv r) =0. (2.48) 


Incorporating the volume V into the Lagrange multipliers, and factoring out the coefficient 
ôf this becomes 


2 
[evst(a tInf—\y oS ) =o. (2.49) 
Since ôf is arbitrary, the integrand must vanish, giving 
mv? 


where the ‘1’ has been incorporated into A1. 

The maximum entropy distribution function of an isolated, energy and particle conserv- 
ing system is therefore 

f = Arexp(—Az2mv?/2); (2.51) 

this is called a Maxwellian distribution function. We will often assume that the plasma is 
locally Maxwellian so that Ay = A1(x,t) and A2 = A(x, t). We define the temperature to 
be 

1 
A2 (x, t) 
where Boltzmann’s factor «x allows temperature to be measured in various units. The nor- 
malization factor is set to be 


KT; (x,t) = (2.52) 


mi N/2 
à (x,t) = t) | — 2.53 
1051) = nd (= 2.53) 
where N is the dimensionality (1, 2, or 3) so that f fo(x, v,t)dNv =n; (x,t). Because the 
kinetic energy of individual particles was defined in terms of velocities measured in the rest 
frame of the center of mass of the complete system of particles, if this center of mass is 
moving in the lab frame with a velocity u,, then in the lab frame the Maxwellian will have 
the form 


Mo 


N/2 
Mo = pe JJag 
me) exp(—m,(v — u,)°/2K7;,). (2.54) 


fo(x, vt) = no ( 
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2.4.3 Relation between pressure and Maxwellian 


The scalar pressure has a simple relation to the generalized Maxwellian as seen by recasting 


Eq.(2.26) as 
Pa -S NoMa a d Jera 
N T dB 
NoMe [B Med B ae (2.55) 
oa) ioe) 
= nokTz, 


which is just the ideal gas law. Thus, the definitions that have been proposed for pressure 
and temperature are consistent with everyday notions for these quantities. 

Clearly, neither the adiabatic nor the isothermal assumption will be appropriate when 
Vpn ~ Vro. The fluid description breaks down in this situation and the Vlasov description 
must be used. It must also be emphasized that the distribution function is Maxwellian 
only if there are sufficient collisions or some other randomizing process. Because of the 
weak collisionality of a plasma, this is often not the case. In particular, since the collision 
frequency scales as v~?, fast particles take much longer to become Maxwellian than slow 
particles. It is not at all unusual for a plasma to be in a state where the low velocity particles 
have reached a Maxwellian distribution whereas the fast particles form a non-Maxwellian 
“tail”. 

We now summarize the two-fluid equations: 


e continuity equation for each species 


Ons 
Ot 


+V- (nous) =0 (2.56) 


e equation of motion for each species 


dus 
nomo =E Nao (E + us xB) -VP, — Roa (2.57) 


e equation of state for each species 


Regime Equation of state Name 
Von >> Ute | Po ~ nd adiabatic 
Von << Ute | Ps = NnoKTo , To=constant | isothermal 


e Maxwell’s equations 


OB 
Vx E=—-— 2.58 
x AE (2.58) 
OE 
V x B =H J nodes ve 2.59 
x Ho vied + Hoo Be (2.59) 
V-B=0 (2.60) 


1 
V-E= goat (2.61) 
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2.5 Magnetohydrodynamic equations 


Particle motion in the two-fluid system was described by the individual species mean veloc- 
ed 


ities ue, u; and by the pressures Po, P ; which gave information on the random deviation 
of the velocity from its mean value. Magnetohydrodynamics is an alternate description of 
the plasma where instead of using ue, u; to describe mean motion, two new velocity vari- 
ables that are a linear combination of ue, u; are used. As will be seen below, this means a 
slightly different definition for pressure must also be used. 

The new velocity-like variables are (i) the current density 


J= N Ne Jo Ug (2.62) 


which is essentially the relative velocity between ions and electrons, and (ii) the center of 
mass velocity 


= 15 monouso: (2.63) 
P s 


where 


p= 5 MoNo (2.64) 


is the total mass density. Magnetohydrodynamics is primarily concerned with low fre- 
quency, long wavelength, magnetic behavior of the plasma. 


2.5.1 MHD continuity equation 


Multiplying Eq.(2.19) by m, and summing over species gives the MHD continuity equa- 
tion 


Op E 
aT V - (pU) =0. (2.65) 


2.5.2 MHD equation of motion 


To obtain an equation of motion, we take the first moment of the Vlasov equation, then 
multiply by m, and sum over species to obtain 


HLM fvat D | mew fate Tae f vgy E+ xB) fo] =0 


(2.66) 
the right hand side is zero since Re; + Rie = 0, i.e., the total plasma cannot exert drag on 
itself. We now define random velocities relative to U (rather than to u, as was the case for 
the two-fluid equations) so that the second term can be written as 


S | mow foatv =5 mtv + U)(v' + U) fdv => | mov'v' fodv+pUU 


(2.67) 
where $`, f MoV'fodv = 0 has been used to eliminate terms linear in v’. The MHD 
pressure tensor is now defined in terms of the random velocities relative to U and is given 
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by 
pees | mgv'v! fdv. (2.68) 


We insert Eqs.(2.67) and (2.68) in Eq.(2.66), integrate by parts on the acceleration term, 
and perform the summation over species to obtain the MHD equation of motion 


O(pU) MHD 
5 +V.: (PUU) = (Era) BraxB-vP (2.69) 


oO 


MHD is typically used to describe phenomena with large spatial scales where the plasma 
is essentially neutral, so that $`, noqo  O. Just as in the two-fluid situation, the left hand 
side of Eq.(2.69) contains a factor times the MHD continuity equation, 

O(pU) 


Op OU 
“a +V-(pUU) = E +V-(90)] Ut Par +pUV-U. (2.70) 


Using Eq.(2.65) leads to the standard form for the MHD equation of motion 


DU 
=~ =Jx B-V-P™MHD (2.71) 
PDE 


where 5 9 
—= —+UV (2.72) 


is the convective derivative using the MHD center of mass velocity. Scalar approximations 


of the MHD pressure tensor will be postponed until after discussing implications of the 
MHD Ohm’s law. 


2.5.3 MHD Ohm’s law 


Equation (2.71) provides one equation relating J and U; let us now find the other one. In 
order to find this second relation between J and U consider the two-fluid electron equation 
of motion: 


du, 
eat 

In MHD we are interested in low frequency phenomena with large spatial scales. If the 
characteristic time scale of the phenomenon is long compared to the electron cyclotron 
motion, then the electron inertia term m_du./dt can be dropped since it is small compared 
to the magnetic force term —e(u. xB). This assumption is reasonable for velocities per- 
pendicular to B, but can be a poor approximation for the velocity component parallel to B, 
since parallel velocities do not provide a magnetic force. Since ue — u; = —J/n-e and 
u; œ U, Eq.(2.73) reduces to the generalized Ohm’s law 


1 
= —e(E+u.xB) — one (neKT.) — VeiMe(Ue — Uj). (2.73) 


1 1 
E +U x B — —J x B + — V (nekT.) = n3. (2.74) 


e e 


The term —J x B/nee on the left hand side of Eq.(2.74) is called the Hall term and can be 
neglected in either of the following two cases: 
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1. The pressure term in the MHD equation of motion, Eq.(2.71) is negligible compared 
to the other two terms which therefore must balance giving 


[J] ~wp|U]/|BI; 


here w ~ D/Dt is the characteristic frequency of the phenomenon. In this case com- 
parison of the Hall term with the U x B term shows that the Hall term is small by 
a factor ~ w/w; where wei = qiB/m; is the ion cyclotron frequency. Thus drop- 
ping the Hall term is justified for phenomena having characteristic frequencies small 
compared to wci. 


2. The electron-ion collision frequency is large compared to the electron cyclotron fre- 
quency Wee = qeB/Me in which case the Hall term may be dropped since it is small 
by a factor wee /Vei compared to the right hand side resistive term nJ =(MeVei /Nnee?)I. 

From now on, when using MHD it will be assumed that one of these conditions is true 
and Hall terms will be dropped (if Hall terms are retained, the system is called Hall MHD). 
Typically, Eq. (2.74) will not be used directly; instead its curl will be used to provide the 
induction equation 


OB 1 


—-— +V x(UxB)- 
a AUB) Nee 


Vne x VKT. = V x (Av x B) (2.75) 
Lo 


Usually the density gradient is parallel to the temperature gradient so that the thermal elec- 
tromotive force term (nee) * Vne X VKT can be dropped, in which case the induction 
equation reduces to 


OB 
-F + Vx (UxB)=0x (Av xB). (2.76) 
ðt Ho 
The thermal term is often simply ignored in the MHD Ohm’s law, which is written as 


E+UxB=nJ; (2.77) 
this is only acceptable providing we intend to take the curl and providing Vne x VAT. ~ 0. 


2.5.4 Ideal MHD and frozen-in flux 


If the resistive term 7J is small compared to the other terms in Eq.(2.77), then the plasma 
is said to be ideal or perfectly conducting. From the Lorentz transformation of electromag- 
netic theory we realize that E + U x B = E’ where E’ is the electric field observed in 
the frame moving with velocity U. This implies that the magnetic flux in ideal plasmas 
is time-invariant in the frame moving with velocity U, because otherwise Faraday’s law 
would imply the existence of an electric field in the moving frame. In order to have the 
magnetic flux invariant in the moving frame, the magnetic field lines must convect with 
the velocity U, i.e., the magnetic field lines are frozen into the plasma and move with the 
plasma. The frozen-in field concept is the essential “bed-rock” concept underlying ideal 
MHD. While this concept is often an excellent approximation, it must be kept in mind 
that the concept becomes invalid in situations when any one of the electron inertia, electron 
pressure, or Hall terms become important and lead to different, more complex behavior. 
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A formal proof of this frozen-in flux property will now be established by direct calcula- 
tion of the rate of change of the magnetic flux through a surface S(t) bounded by a material 
line C(t), i.e., a closed contour which moves with the plasma. This magnetic flux is 


a(t) = | B(x, t)-ds (2.78) 
S(t) 


and the flux changes with respect to time due to either (i) the explicit time dependence of 
B(t) or (ii) changes in the surface S(t) resulting from plasma motion. The rate of change 
of flux is thus 


¥ B(x, t + 6t)-ds— B(x, t)-ds 
DE — lim (a ( ) Jsw (x,t) ) (2.79) 


Dt ~ ôt—0 ôt 


The displacement of a segment dl of the bounding contour Cis Uôt where U is the velocity 
of this segment. The incremental change in surface area due to this displacement is AS = 
U6t x dl. The rate of change of flux can thus be expressed as 


OB 
J (Bex. + ate) as- f B(x, t)-ds 
DP , S(t46t) at S(t) 
— = lm 
Dt ôt—0 ôt 
OB 
Í B(x, t) + ôt-—— dst f B(x, t) - Udt x a- f B(x, t)-ds 
. S(t) ot c S(t) 
lim 
ôt—0 ôt 
OB 
= Sree f B(x,t)-U x dl 
s(t) Ot c 
OB 
= | ety x (B x 0) ds. (2.80) 
S(t) 
Thus, if 
OB 
TV x (U xB) (2.81) 
then ne 
— =0 2.82 
Di (2.82) 


so that the magnetic flux linked by any closed material line is constant. Therefore, magnetic 
flux is frozen into an ideal plasma because Eq.(2.76) reduces to Eq.(2.81) if77 = 0. Equation 
(2.81) is called the ideal MHD induction equation. 


2.5.5 MHD equations of state 


Double adiabatic laws A procedure analogous to that which led to Eq.(2.35) gives the 


MHD adiabatic relation 
PMHD 


= const. (2.83) 


where again y = (N + 2)/N and N is the number of dimensions of the system. It 
was shown in the previous section that magnetic flux is conserved in the plasma frame. 
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This means that, as shown in Fig.2.5, a tube of plasma initially occupying the same vol- 
ume as a magnetic flux tube is constrained to evolve in such a way that f B-ds stays con- 
stant over the plasma tube cross-section. For a flux tube of infinitesimal cross-section, the 
magnetic field is approximately uniform over the cross-section and we may write this as 
BA = const. where A is the cross-sectional area. Let us define two temperatures for this 
magnetized plasma, namely T, the temperature corresponding to motions perpendicular to 
the magnetic field, and T the temperature corresponding to motions parallel to the mag- 
netic field. If for some reason (e.g., anisotropic heating or compression) the temperature 
develops an anisotropy such that T, 4 7) and if collisions are infrequent, this anisotropy 
will persist for a long time, since collisions are the means by which the two temperatures 
equilibrate. Thus, rather than assuming that the MHD pressure is fully isotropic, we con- 
sider the less restrictive situation where the MHD pressure tensor is given by 


as P 0 0 = we 
PMED_/| 9 P, 0 | =P, I +(P)— P.)BB. (2.84) 
0o 0 R] 


The first two coordinates (x, y-like) in the above matrix refer to the directions perpendic- 

ular to the local magnetic field B and the third coordinate (z-like) refers to the direction 
— 

parallel to B. The tensor expression on the right hand side is equivalent (here I is the unit 

tensor) but allows for arbitrary, curvilinear geometry. We now develop separate adiabatic 

relations for the perpendicular and parallel directions: 


e Parallel direction- Here the number of dimensions is N = 1 so that y = 3 and so 
the adiabatic law gives 


ll = const. (2.85) 


where p;p is the one-dimensional mass density; i.e., jp ~ 1/Lwhere L is the 
length along the one-dimension, e.g. along the length of the flux tube in Fig.2.5. The 
three-dimensional mass density p, which has been used implicitly until now has the 
proportionality p ~ 1/LA where A is the cross-section of the flux tube; similarly 
the three dimensional pressure has the proportionality P) ~ pT). However, we must 
be careful to realize that PI” ~ pıpT so, using BA = const., Eq. (2.85) can be 


recast as 
Pir oT 1 
const. = = ~ Piwi ~T ~ (z) Tı(LA)?B?. (2.86) 
PID Pip LA —“—— 
—_e—“-" p73 
Pi 
or 
PIB? 
—— = const. (2.87) 


e Perpendicular direction- Here the number of dimensions is N = 2 so that y = 2 
and the adiabatic law gives 


Pp2P 
—— = const. (2.88) 
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where P?P is the 2-D perpendicular pressure, and has dimensions of energy per 
unit area, while pəp is the 2-D mass density and has dimensions of mass per unit 
area, Thus, pọp ~ 1/A so that P?? ~ pọpTı ~ T/A in which case Eq.(2.88) 
can be re-written as 


P?P 1 LA 
E (za) “B R 
or 
P 
aE = const. (2.90) 


Equations (2.87) and (2.90) are called the double adiabatic or CGL laws after Chew, 
Goldberger and Low (1956) who first developed them using a Vlasov analysis). 


Figure 2.5: Magnetic flux tube with flux 6 = BA. 


Single adiabatic law If collisions are sufficiently frequent to equilibrate the perpen- 
dicular and parallel temperatures, then the pressure tensor becomes fully isotropic and the 
dimensionality of the system is N = 3 so that y = 5/3. There is now just one pressure and 
temperature and the adiabatic relation becomes 


= 7 const. (2.91) 
pe 


2.5.6 MHD approximations for Maxwell’s equations 


The various assumptions contained in MHD lead to a simplifying approximation of Maxwell’s 
equations. In particular, the assumption of charge neutrality in MHD makes Poisson’s 
equation superfluous because Poisson’s equation prescribes the relationship between non- 
neutrality and the electrostatic component of the electric field. The assumption of charge 
neutrality has implications for the current density also. To see this, the 2-fluid continuity 
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equations is multiplied by q, and then summed over species to obtain the charge conserva- 
tion equation 


O 
= £ nods) iV. SO: (2.92) 
Thus, charge neutrality implies 
V-J=0. (2.93) 
Let us now consider Ampere’s law 


OE 
V x B =uoJ+Ho£E0 


~ (2.94) 


Taking the divergence gives 


V - J+£0 2 =0 
which is equivalent to Eq.(2.92) if Poisson’s equation is invoked. 

Finally, MHD is restricted to phenomena having characteristic velocities Vp, slow com- 
pared to the speed of light in vacuum c = (€ofg)~'/?. Again t is assumed to represent the 
characteristic time scale for a given phenomenon and / is assumed to represent the cor- 
responding characteristic length scale so that V,, ~ l/t. Faraday’s equation gives the 
scaling 


OB 
Vx E=— -y => E~ Bift. (2.95) 


On comparing the magnitude of the displacement current term in Eq.(2.94) to the left hand 
side it is seen that 


on 2E 
POO en) EBE A VaN (2.96) 
lV x BI Bil ope 


Thus, if Vpn << c the displacement current term can be dropped from Ampere’s law 
resulting in the so-called “pre-Maxwell” form 


V x B =J. (2.97) 


The divergence of Eq. (2.97) gives Eq.(2.93) so it is unnecessary to specify Eq.(2.93) 
separately. 


2.6 Summary of MHD equations 


We may now summarize the MHD equations: 
1. Mass conservation 


Op o 
aF V - (pU) =0. (2.98) 


2. Equation of state and associated equation of motion 


(a) Single adiabatic regime, collisions equilibrate perpendicular and parallel tem- 
peratures so that both pressure and temperature are isotropic 


PJE = const. (2.99) 
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and the equation of motion is 


~~ =J x B-VP. (2.100) 


(b) Double adiabatic regime, the collision frequency is insufficient to equilibrate 
perpendicular and parallel temperatures so that 


P| B? 
p? 


P 
= const., — = const. (2.101) 
pB 


and the equation of motion is 


DU ae 
Pp, =IXB-V. P, T +(P)-P.)BB). (2.102) 


3. Faraday’s Law 


OB 
VxE=-—. 2.1 
x AE (2.103) 
4. Ampere’s Law 
V x B =J. (2.104) 
5. Ohm’s Law 
E+UxBe=nJ. (2.105) 


These equations provide a self-consistent description of phenomena that satisfy all the 
various assumptions we have made, namely: 
(i) The plasma is charge-neutral since characteristic lengths are much longer than a 
Debye length; 
(ii) The characteristic velocity of the phenomenon under consideration is slow com- 
pared to the speed of light; 
(iii) The pressure and density gradients are parallel, so there is no electrothermal EMF; 
(iv) The time scale is long compared to both the electron and ion cyclotron periods. 
Even thought these assumptions are self-consistent, they may not accurately portray a 
real plasma and so MHD models, while intuitively appealing, must be used with caution. 


2.7 Sheath physics and Langmuir probe theory 


Let us now turn attention back to Vlasov theory and discuss an immediate practical ap- 
plication of this theory. The properties of collisionless Vlasov equilibria can be combined 
with Poisson’s equation to develop a model for the potential in the steady-state transition 
region between a plasma and a conducting wall; this region is known as a sheath and is 
important in many situations. The sheath is non-neutral and its width is of the order of a 
Debye length. The exact sheath potential profile must be solved numerically because of 
the transcendental nature of the relevant equations, but a useful approximate solution can 
be obtained by a simple analytic argument which will now be discussed. Sheath physics is 
of particular importance for interpreting the behavior of Langmuir probes which are small 
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metal wires used to diagnose low temperature plasmas. Biasing a Langmuir probe at a se- 
quence of voltages and then measuring the resulting current provides a simple way to gauge 
both the plasma density and the electron temperature. 


Langmuir probe 
(or metal wall) 


potential with convex curvature 


plasma 


® probe 


-o0<x<0 x=0 


Figure 2.6: Sketch of sheath. Ions are accelerated in sheath to probe (wall) at x = 0 whereas 
electrons are repeled. Convex curvature of sheath requires n;(x) to always be greater than 
Ne (x). 


The model presented here is the simplest possible model for sheaths and Langmuir 
probes and is one-dimensional. The geometry, sketched in Fig.2.6, idealizes the Langmuir 
probe as a metal wall located at x = 0 and biased to a potential ¢,,,.,4¢.; this geometry 
could also be used to describe an actual biased metal wall at x = O in a two-dimensional 
plasma. The plasma is assumed to be collisionless and unmagnetized and to have an am- 
bipolar potential plasma Which differs from the laboratory reference potential (so-called 
ground potential) because of a difference in the diffusion rates of electrons and ions out 
of the plasma. The plasma is assumed to extend into the semi-infinite left-hand half-plane 
—00 < T < 0. If probe = Êplasma» then neither electrons nor ions will be accelerated or 
decelerated on leaving the plasma and so each species will strike the probe (or wall) at a 
rate given by its respective thermal velocity. Since me << m;, the electron thermal veloc- 
ity greatly exceeds the ion thermal velocity. Thus, for probe = Ppiasma the electron flux 
to the probe (or wall) greatly exceeds the ion flux and so the current collected by the probe 
(or wall) will be negative. 

Now consider what happens to this electron flow if the probe (or wall) is biased negative 
with respect to the plasma as shown in Fig.2.6. To simplify the notation, a bar will be used 
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to denote a potential measured relative to the plasma potential, i.e., 


(x) = e(z) E Pplasma: (2.106) 


The bias potential imposed on the probe (or wall) will be shielded out by the plasma within 
a distance of the order of the Debye length; this region is the sheath. The relative potential 
@(a) varies within the sheath and has the two limiting behaviors: 


im d(x) = Pprobe =, Pplasma 
lim ¢(z) = 0. (2.107) 
|z|>>ApD 


Inside the plasma, i.e., for |x| >> Ap, it is assumed that the electron distribution function is 
Maxwellian with temperature Te. Since the distribution function depends only on constants 
of the motion, the one-dimensional electron velocity distribution function must depend only 
on the electron energy mv?/2 + q-@(zx), a constant of the motion, and so must be of the 


form 5 
2 
no mv ee) ) 
e(U,2) = ex (2.108) 
ae \/T2KT, /Me “3 ( ( KT. 
in order to be Maxwellian when z >> Ap. 
The electron density is 


CO 

ne(x) = / du fe(v,0) = ngewe P(™)/K Fe (2.109) 
<00 

When the probe is biased negative with respect to the plasma, only those electrons with 

sufficient energy to overcome the negative potential barrier will be collected by the probe. 

The ion dynamics is not a mirror image of the electron dynamics. This is because a 
repulsive potential prevents passage of particles having insufficient initial energy to climb 
over a potential barrier whereas an attractive potential allows passage of all particles en- 
tering a region of depressed potential. Particle density is reduced compared to the inlet 
density for both repulsive and attractive potentials but for different reasons. As shown in 
Eq.(2.109) a repulsive potential reduces the electron density exponentially (this is essen- 
tially the Boltzmann analysis developed in the theory of Debye shielding). Suppose the 
ions are cold and enter a region of attractive potential with velocity uo. Flux conservation 
shows that nouo = n;(x)u;(x) and since the ions accelerate to higher velocity when falling 
down the attractive potential, the ion density must also decrease. Thus the electron den- 
sity scales as exp(— |\geo| /KTe) and so decreases upon approaching the wall in response 
to what is a repulsive potential for electrons whereas the ion density scales as 1/u;(a) and 
also decreases upon approaching the wall in response to what is an attractive potential for 
ions. 

Suppose the probe is biased negatively with respect to the plasma. Since quasi-neutrality 
within the plasma mandates that the electric field must vanish inside the plasma, the po- 
tential must have a downward slope on going from the plasma to the probe and the deriv- 
ative of this slope must also be downward. This means that the potential @ must have a 
convex curvature and a negative second derivative as indicated in Fig.2.6. However, the 
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one-dimensional Poisson’s equation 
do e 
dx? E0 


(ni(£) — ne(x)) (2.110) 


shows that in order for ¢ to have convex curvature, it is necessary to have n;(x) > n(x) 
everywhere. This condition will now be used to estimate the inflow velocity of the ions at 
the location where they enter the sheath from the bulk plasma. 

Ion energy conservation gives 


1 = 1 
gi (x) + ed(x) = ait (2.111) 


u(x) = Jug — 2ed(x)/mj. (2.112) 


Using the ion flux conservation relation nouo = 7;(x)u;(a) the local ion density is found 
to be 


which can be solved to give 


x)= = (2.113) 
(1 — 2e@(x) /mus) ia l 


The convexity requirement n; (£) > n-(x) implies 


(1 — 2€6(x)/mju) Y? > eP0/sT:, (2.114) 
Bearing in mind that (x) is negative, this can be re-arranged as 
2e|d TET 
1+ Pelee) < 2elA(a)|/KTe (2.115) 
MUD 
or 7 E E i E é 
2eld(a)| — 2elo(a)| , 1 (2€lo@)I\" 1 (2€l@@)) =" 
z < - H f H ; eee (2.116) 
muĝ KT. 2 KTe 3! KT. 
which can only be satisfied for arbitrary |(a)| if 
ug > 2KT./m, . (2.117) 


Thus, in order to be consistent with the assumption that the probe is more negative than 
the plasma to keep d?@/dx? negative and hence # convex, it is necessary to have the 
ions enter the region of non-neutrality with a velocity slightly larger than the so-called “ion 
acoustic” velocity cy = \/KT./m;. 
The ion current collected by the probe is given by the ion flux times the probe area, i.e., 
TI, = nouogA 
= nocseA. (2.118) 


The electron current density incident on the probe is 
J duqev felv, 0) 
0 
Nogee” 19(2)/#Te 0o. 
V T2KTe/Me 0 


T. a 
= noge | ——e 1 @)I/8Te (2.119) 
27Me 


J.(x) 


pera 
du ve ™? IIR Te 
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and so the electron current collected at the probe is 


i 
(ee ey ne Me (2.120) 


Thus, the combined electron and ion current collected by the probe is 


I= G+ (2.121) 


Te TEN 
Rie oelle) /sTe. 


= noc,eA — noeA 
27Me 


The electron and ion currents cancel each other when 


aes =/ Kle -elb(a)|/nTe (2.122) 
Mi 27Me 
i.e., when 
a Mi 
€|dprobel/KTe a Anm 
= 2.5 for hydrogen. (2.123) 


This can be expressed as 


KT | Mi 
Pprobe = Opiasma e ln ATMe (2.124) 


and shows that when the probe potential is more negative than the plasma potential by 
an amount T, ln/m;/4nmMme where T, is expressed in electron volts, then no current 
flows to the probe. This potential is called the floating potential, because an insulated 
object immersed in the plasma will always charge up until it reaches the floating potential 
at which point no net current flows to the object. 

These relationships can be used as simple diagnostic for the plasma density and tem- 
perature. If a probe is biased with a large negative potential, then no electrons are collected 
but an ion flux is collected. The collected current is called the ion saturation current and 


is given by Isat = NnocseA. The ion saturation current is then subtracted from all subse- 
1/2 


quent measurements giving the electron current J. = I — Isat = noeA (KTe/2rmMe) 
exp (—e|ġ(x)|/KTe) . The slope of a logarithmic plot of Ie versus @ gives 1/«T. and can 
be used to measure the electron temperature. Once the electron temperature is known, Cs 
can be calculated. The plasma density can then be calculated from the ion saturation cur- 
rent measurement and knowledge of the probe area. Langmuir probe measurements are 
simple to implement but are not very precise, typically having an uncertainty of a factor of 
two or more. 
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2.8 Assignments 


Prove Stirling’s formula. To do this first show 


In N! 


Inl4+1In2+1n3+... InN 
N 

S inj 

j=l 


Now assume JN is large and, using a graphical argument, show that the above expres- 
sion can be expressed as an integral 


? 


Invi | h(x)dz 
? 


where the form of h(x) and the limits of integration are to be provided. Evaluate the 
integral and obtain Stirling’s formula 


InN! ~ NmN-N for large N 


which is a way of calculating the values of factorials of large numbers. Check the 
accuracy of Stirling’s formula by evaluating the left and right hand sides of Stirling’s 
formula numerically and plot the results for N = 1, 10, 100, 1000, 104 and higher if 
possible. 


Variational calculus and Lagrange multipliers- The entropy associated with a distrib- 
ution function is 


S= f(v) In f(v) dv. 
(a) Since f(v) measures the number of particles that have velocity v, use physical 
arguments to explain what value f(+:oo) must have. 


(b 


wm 


Suppose that fm 2(v) is the distribution function having the maximum entropy 
out of all distribution functions allowed for the problem at hand. Let 6f(v) be 
some small arbitrary deviation from fm p (v). What is the entropy S + ôS asso- 
ciated with the distribution function fm 2(v) + 6f(v)? What is the differential 
of entropy ôS between the two situations? 


(c) Each particle in the distribution function has a kinetic energy mv2/2 and sup- 
pose that there are no external forces acting on the system of particles so that 
the potential energy of each particle is zero. Let Æ be the total energy of all 
the particles. How does Æ depend on the distribution function? If the system is 
isolated and the system changes from having a distribution function f mpg (v) to 
having the distribution function fmg(v) + 6f(v) what is the change in energy 
OE between these two cases? 


(d) By now you should have integral expressions for ôS and for ôE. Both of these 
integrals should have what value? Make a rough sketch showing any possi- 
ble, nontrivial v dependence of the integrands of these expressions (i.e., show 
whether the integrands are positive definite or not). 


(e) Since 6f was assumed to be arbitrary, what can you say about the ratio of the 
integrand in the expression for ôS to the integrand in the expression for 6? Is 
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this ratio constant or not (explain your answer)? Let this ratio be denoted by 
(this is called a Lagrange multiplier). 


(£) Show that the conclusion reached in ‘e’ above lead to the f mp (v) having to be 
a Maxwellian. 


. Suppose that a group of N particles with charge gq, and mass me are located in an 
electrostatic potential ¢(x). What is the maximum entropy distribution function for 
this situation (give a derivation)? 


. Prove that 


J dre” =,/Z (2.125) 
a 
Hint: Consider the integral 


/ ie dye! =| / dxdy e (+97) 


and note that dzdy is an element of area. Then instead of using Cartesian coordinates 
for the integral over area, use cylindrical coordinates and express all quantities in the 
double integral in cylindrical coordinates. 


Q 2 ee 2 
J dg xe" %* 3 dg grtet”, 
=o = 


Hint: Take the derivative of both sides of Eq.(2.125) with respect to a. 


. Evaluate the integrals 


. Suppose that a particle starts at time t = to with velocity vo at location xo and is 
located in a uniform, constant magnetic field B =B2. There is no electric field. Cal- 
culate its position and velocity as a function of time. Make sure that your solution 
satisfies the initial conditions on both velocity and position. Be careful to treat mo- 
tion parallel to the magnetic field as well as perpendicular. Express your answer in 
vector form as much as possible; use the subscripts ||, -L to denote directions parallel 
and perpendicular to the magnetic field, and use we = qB /m to denote the cyclotron 
frequency. Show that f (xo) is a solution of the Vlasov equation. 


. Thermal force (Braginskii 1965)- If there is a temperature gradient then because of 
the temperature dependence of collisions, there turns out to be an additional subtle 
drag force proportional to VT. To find this force, suppose a temperature gradient 
exists in the x direction, and consider the frictional drag on electrons passing a point 
x = xo, The electrons moving to the right (positive velocity) at xo have travelled 
without collision from the point £o — lm fp, Where the temperature was T (£o — lm fp), 
while those moving to the left (negative velocity) will have come collisionlessly from 
the point £o+lm fp where the temperature is T (£o+lm fp). Suppose that both electrons 
and ions have no mean velocity at xo; i.€., Ue = u; = 0. Show that the total drag force 
on all the electrons at £o is 


Rihma = —2mMeNelm tp r (VeiV Te) : 
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Normalize the collision frequency, thermal velocity, and mean free paths to their 
values at x = zo where T = Tọ; e.g. vre(T) = vreo(T/To)!/?. By writing 
0/0x = (OT /0x) 0/OT and using these normalized values show that 


Rihermal = —2NeKkV Te. 


A more accurate treatment which does a proper averaging over velocities gives 


Rihermal = —0.71neKkV Te. 


. MHD with neutrals- Suppose a plasma is partially (perhaps weakly) ionized so that 


besides moment equations for ions and electrons there will also be moment equations 
for neutrals. Now the constraints will be different since ionization and recombination 
will genuinely produce creation of plasma particles and also of neutrals. Construct a 
set of constraint equations on the collision operators which will now include ionization 
and recombination as well as scattering. Take the zeroth and first moment of the 
three Vlasov equations for ions, electrons, and neutrals and show that the continuity 
equation is formally the same as before, i.e., 


dp 
£ +V-(pU)=0 
ar + V (PU) 


providing p refers to the total mass density of the entire fluid (electrons, ions and 
neutrals) and U refers to the center of mass velocity of the entire fluid. Show also that 
the equation of motion is formally the same as before, provided the pressure refers to 
the pressure of the entire configuration: 


DU 

P De 

Show that Ohm’s law will be the same as before, providing the plasma is sufficiently 
collisional so that the Hall term can be dropped, and so Ohm’s law is 


=-—-VP +J xB. 


E +U x B =J 


Explain how the neutral component of the plasma gets accelerated by the J x B force 
— this must happen since the inertial part of the equation of motion (i.e., oDU/Dt) 
includes the acceleration on neutrals. Assume that electron temperature gradients are 
parallel to electron density gradients so that the electro-thermal force can be ignored. 


. MHD Heat Transport Equation- Define the MHD N—dimensional pressure 


1 
P= me [vv fea 
where v’ = v — U and N is the number of dimensions of motion (e.g., if only motion 


in one dimensions is considered, then N = 1, and v, U are one dimensional, etc.). 
Also define the isotropic MHD heat flux 


v’? 
a= ome f vany 
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(i) By taking the second moment of the Vlasov equation for each species (i.e., use 
v? /2) and summing over species show that 
NDP N+2 


Hints: 
(a) Prove that U- (mo ES v'v' fod’ v) = PU assuming that f, is isotropic. 
(b) What happens to X, Mmo fv?Coa fod’ v? 


(c) Prove using the momentum and continuity equations that 


a (=) Lv. (=v) =-U-VP+U-(J x B). 


ot \ 2 2 


(ii) Using the continuity equation and Ohm’s law show that 


NDP N+2PDp _ 
IDF 2 pDt 


Show that if both the heat flux term —V - q and the Ohmic heating term nJ? can be 
ignored, then the pressure and density are related by the adiabatic condition P ~ p7 
where y = (N + 2)/N. By assuming that D/Dt ~ w and that V ~ k show that the 
dropping of these two right hand terms is equivalent to assuming that w >> Vei and 
that w/k >> vr, Explain why the phenomenon should be isothermal if w/k << vr., 


V-qtnJ? 


10. Sketch the current collected by a Langmuir probe as a function of the bias voltage 
and indicate the ion saturation current, the exponentially changing electron current, 
the floating potential, and the plasma potential. Calculate the ion saturation current 
collected by a 1 cm long, 0.25 mm diameter probe immersed in a 5 eV argon plasma 
which has a density n = 10! m~?. Calculate the electron saturation current also (i.e., 
the current when the probe is at the plasma potential). What is the offset of the floating 
potential relative to the plasma potential? 


Motion of a single plasma particle 


3.1 Motivation 


As discussed in the previous chapter, Maxwellian distributions result when collisions have 
maximized the local entropy. Since collisions occur infrequently in hot plasmas, many im- 
portant phenomena have time scales much shorter than the time required for the plasma 
to relax to a Maxwellian. A collisionless model of the plasma is thus required to charac- 
terize these fast phenomena. Since randomization does not occur in collisionless plasmas, 
entropy is conserved and the distribution function is typically non-Maxwellian. Such a 
plasma is not in thermodynamic equilibrium and so thermodynamic concepts do not in 
general apply. 

In Sect.2.2 it was shown that any function constructed from constants of the particle 
motion is a solution of the collisionless Vlasov equation. It is therefore worthwhile to 
develop a ‘repertoire’ of constants of the motion which can then be used to construct solu- 
tions to the Vlasov equation appropriate for various circumstances. Furthermore, the study 
of single particle motion is a worthy endeavor because it: 

(1) develops valuable intuition, 

(ii) highlights unusual situations requiring special treatment, 

(iii) gives valuable insight into fluid motion. 


3.2 Hamilton-Lagrange formalism v. Lorentz equation 


Two mathematically equivalent formalisms describe charged particle dynamics; these are 
(i) the Lorentz equation 


ma =q(E +v x B) (3.1) 
dt 
and (ii) Hamiltonian-Lagrangian dynamics. 

The two formalisms are complementary: the Lorentz equation is intuitive and suitable 
for approximate methods, whereas the more abstract Hamiltonian-Lagrange formalism ex- 
ploits time and space symmetries. A brief review of the Hamiltonian-Lagrangian formalism 
follows, emphasizing aspects relevant to dynamics of charged particles. 

The starting point is to postulate the existence of a function L, called the Lagrangian, 
which: 
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1. contains all information about the particle dynamics for a given situation, 


2. depends only on generalized coordinates Q;(t),Q;(t) appropriate to the problem, 


3. possibly has an explicit dependence on time t. 
If such a function L(Q;(t), Qi (t), t) exists, then information on particle dynamics is 
retrieved by manipulation of the action integral 


S= T * L(Qi(t), Qi (0), Bat. (3.2) 


ty 

This manipulation is based on d’Alembert’s principle of least action. According to this 
principle, one considers the infinity of possible trajectories a particle could follow to get 
from its initial position Q;(t,) to its final position Q;(t2), and postulates that the trajectory 
actually followed is the one which results in the least value of S. Thus, the value of S 
must be minimized (note that S here is action, and not entropy as in the previous chapter). 
Minimizing S does not give the actual trajectory directly, but rather gives equations of 
motion which can be solved to give the actual trajectory. 

Minimizing S is accomplished by considering an arbitrary nearby alternative trajectory 
Q(t) + 6Q;(t) having the same beginning and end points as the actual trajectory, i.e., 
6Q;(t1) = 6Q;(t2) = 0. In order to make the variational argument more precise, 6Q; is 
expressed as 

5Qi(t) = €n,(t) (3.3) 
where € is an arbitrarily adjustable scalar assumed to be small so that €? < € and 77,(t) is a 
function of t which vanishes when t = tı or t = tə but is otherwise arbitrary. Calculating 
6S to second order in € gives 


ôS = f Q os 6Qi, Qi aE Hiat- f 1(Q4.Q:,1at 


tı 


= f uQ + en, Qi + ay, that =f” IQQ tat 


ti 


t2 2 92 >22 92 
| (or OF y AA er eae iets) 2 =) dt. (3.4) 
A AQ: 2 0Q; AQ: 2 aQ; 
Suppose that the trajectory Q;(t) is the one that minimizes S. Any other trajectory must 
lead to a higher value of S and so ôS must be positive for any finite value of e. If € is 
chosen to be sufficiently small, then the absolute values of the terms of order e€? in Eq.(3.4) 
will be smaller than the absolute values of the terms linear in e. The sign of € could then 
be chosen to make ôS negative, but this would violate the requirement that ôS must be 
positive. The only way out of this dilemma is to insist that the sum of the terms linear in € 
in Eq.(3.4) vanishes so that 6S ~ e? and is therefore always positive as required. Insisting 
that the sum of terms linear in € vanishes implies 


t2 OL OL 
0= ia t+ ț ) et. (3.5) 
I (n OQ: 2 a 


Using 1); = dn;/dt the above expression may be integrated by parts to obtain 


aaa 
ti "AQ, dt AQ; 


OL |” t2 OL d (OL 
= Picts l , i dt. ‘ 
h a i A J, fn aQi "a ta } i Se 


0 
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Since n; (t1,2) = 0 the integrated term vanishes and since 7, was an arbitrary function of t, 
the coefficient of 7; in the integrand must vanish, yielding Lagrange’s equation 


dP; OL 
a sI: 
dt 0Q; G2) 
where the canonical momentum P, is defined as 
OL 
P; = —. (3.8) 
Qi 


Equation (3.7) shows that if L does not depend on a particular generalized coordinate 
Q; then dP; /dt = 0 in which case the canonical momentum P; is a constant of the motion; 
the coordinate Q; is called a cyclic or ignorable coordinate. This is a very powerful and 
profound result. Saying that the Lagrangian function does not depend on a coordinate 
is equivalent to saying that the system is symmetric in that coordinate or translationally 
invariant with respect to that coordinate. The quantities P; and Q; are called conjugate 
and action has the dimensions of the product of these quantities. 

Hamilton extended this formalism by introducing a new function related to the La- 
grangian. This new function, called the Hamiltonian, provides further useful information 


and is defined as 
= (= ra) = (3.9) 


t 
Partial derivatives of H with respect to P; and to Q; give Hamilton’s equations 


24), ge (3.10) 
‘OP; ‘AQ; Í 
which are equations of motion having a close relation to phase-space concepts. The time 
derivative of the Hamiltonian is 


dH qh | dQ; OL aL dQ; l _ OL 
dt £ a Ot Pae 2- Q- 230; dt ` d at” 
(3.11) 

This shows that if L does not explicitly depend on time, i.e., 0L/Ot = 0, the Hamiltonian 
H is a constant of the motion. As will be shown later, H corresponds to the energy of the 
system, so if OL /Ot = 0, the energy is a constant of the motion. Thus, energy is conjugate 
to time in analogy to canonical momentum being conjugate to position (note that energy x 
time also has the units of action). If the Lagrangian does not explicitly depend on time, then 
the system can be thought of as being ‘symmetric’ with respect to time, or “translationally’ 
invariant with respect to time. 

The Lagrangian for a charged particle in an electromagnetic field is 


2 
L = "> +qv- A(x,t) — g0(x,0); (3.12) 


the validity of Eq.(3.12) will now be established by showing that it generates the Lorentz 
equation when inserted into Lagrange’s equation. Since no symmetry is assumed, there is 
no reason to use any special coordinate system and so ordinary Cartesian coordinates will 
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be used as the canonical coordinates in which case Eq.(3.8) gives the canonical momentum 
as 


P =mv + qA (x,t). (3.13) 
The left hand side of Eq.(3.7) becomes 


dP dv OA 
= tv-VA 14 
de at’! ( a ) ee 
while the right hand side of Eq.(3.7) becomes 


OL 


ae qv (v-A)—qVd=q(v:-VA+4+vxV x A) —qV¢ 


q(v:-VA+v x B) —qV¢. 


(3.15) 


Equating the above two expressions gives the Lorentz equation where the electric field is 
defined as E = —OA/0t — Vo in accord with Faraday’s law. This proves that Eq.(3.12) 
is mathematically equivalent to the Lorentz equation when used with the principle of least 
action. 

The Hamiltonian associated with this Lagrangian is, in Cartesian coordinates, 


H = P-v-L 
mv? 
DAE i 
P—gA 2 
a TAG) - qo(x.t) (3.16) 


where the last line is the form more suitable for use with Hamilton’s equations, i.e., H = 
H(x,P,t). Equation (3.16) also shows that H is, as promised, the particle energy. If 
generalized coordinates are used, the energy can be written in a general form as E = 
H(Q, P,t). Equation (3.11) showed that even though both Q and P depend on time, the 
energy depends on time only if H explicitly depends on time. Thus, in a situation where H 
does not explicitly depend on time, the energy would have the form E = H(Q(t), P(t)) = 
const. 

It is important to realize that both canonical momentum and energy depend on the 
reference frame. For example, a bullet fired in an airplane in the direction opposite to 
the airplane motion and with a speed equal to the airplane’s speed, has a large energy as 
measured in the airplane frame, but zero energy as measured by an observer on the ground. 
A more subtle example (of importance to later analysis of waves and Landau damping) 
occurs when A and/or ¢ have a wave-like dependence, e.g. (x,t) = g(x — Vpnt) where 
Vpn is the wave phase velocity. This potential is time-dependent in the lab frame and 
so the associated Lagrangian has an explicit dependence on time in the lab frame, which 
implies that energy is not a constant of the motion in the lab frame. In contrast, ¢ is time- 
independent in the wave frame and so the energy is a constant of the motion in the wave 
frame. Existence of a constant of the motion reduces the complexity of the system of 
equations and typically makes it possible to integrate at least one equation in closed form. 
Thus it is advantageous to analyze the system in the frame having the most constants of the 
motion. 
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3.3 Adiabatic invariant of a pendulum 


Perfect symmetry is never attained in reality. This leads to the practical question of how 
constants of the motion behave when space and/or time symmetries are ‘good’, but not 
perfect. Does the utility of constants of the motion collapse abruptly when the slightest 
non-symmetrical blemish rears its ugly head, does the utility decay gracefully, or does 
something completely different happen? To answer these questions, we begin by consid- 
ering the problem of a small-amplitude pendulum having a time-dependent, but slowly 
changing resonant frequency w(t). Since w? = g/L, the time-dependence of the frequency 
might result from either a slow change in the gravitational acceleration g or else from a 
slow change in the pendulum length l. In both cases the pendulum equation of motion will 
be ` 
a +w?’ (t)z =0. (3.17) 
This equation cannot be solved exactly for arbitrary w(t) but for if a modest restriction is put 
on w(t) the equation can be solved approximately using the WKB method (Wentzel 1926, 
Kramers 1926, Brillouin 1926). This method is based on the hypothesis that the solution 
for a time-dependent frequency is likely to be a generalization of the constant-frequency 
solution 

x = Re [A exp(iwt)], (3.18) 
where this generalization is guessed to be of the form 


a(t) = Re EOG F sose] i (3.19) 


Here A(t) is an amplitude function determined as follows: calculate the first derivative of 
Eq. (3.19), 


d : t t 1 dA . t 1 1 
= = Re fiva! ojd aot ol a l (3.20) 
then the second derivative 
d? d dA PAN ore Hiv 
a = Re (a Fiw E -At ) el ot a| ; (3.21) 


and insert this last result into Eq. (3.17) which reduces to 


dw dA dA 
i— A 4 2i =0 
“dt at 
since the terms involving w? cancel exactly. To proceed further, we make an assumption 
— the validity of which is to be checked later — that the time dependence of dA/d¢t is 
sufficiently slow to allow dropping the last term in Eq. (3.22) relative to the middle term. 
The two terms that remain in Eq. (3.22) can then be re-arranged as 


(3.22) 


1 dw 2dA 

ee oi ee 3.23 

w dt A dt ( ) 
which has the exact solution i 

A(t) ~ (3.24) 
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The assumption of slowness is thus at least self-consistent, for if w(t) is indeed slowly 
changing, Eq.(3.24) shows that A(t) will also be slowly changing and the dropping of the 
last term in Eq.(3.22) is justified. The slowness requirement can be quantified by assuming 
that the frequency has an exponential dependence 


w(t) = woe. (3.25) 
Thus 


t 
is a measure of how fast the frequency is changing compared to the frequency itself. Hence 
dropping the last term in Eq.(3.22) is legitimate if 


a << dw (3.27) 


or 1 d 
Bee 2S Ag (3.28) 


In other words, if Eq.(3.28) is satisfied, then the fractional change of the pendulum period 
per period is small. 

Equation (3.24) indicates that when w is time-dependent, the pendulum amplitude is not 
constant and so the pendulum energy is not conserved. It turns out that what is conserved 
is the action integral 


S = Q vdr (3.29) 


where the integration is over one period of oscillation. This integral can also be written in 


terms of time as 
totT dz 
S= f v—dt (3.30) 
to dt 


where to is a time when z is at an instantaneous maximum and 7, the period of a complete 
cycle, is defined as the interval between two successive times when da/dt = 0 and d?x/dt? 
has the same sign (e.g., for a pendulum, to would be a time when the pendulum has swung 
all the way to the right and so is reversing its velocity while 7 is the time one has to wait 
for this to happen again). To show that action is conserved, Eq. (3.29) can be integrated by 


parts as 
“+r fa / dz dx Pa 
x x 
to dt dt dt? 


da] 7 ee ax 
x6 — x—dt 
dt A dt? 


to 


totT ; 
= i wx dt (3.31) 
to 


U 
II 


where (i) the integrated term has vanished by virtue of the definitions of tọ and 7,and (ii) 
Eq.(3.17) has been used to substitute for d?/dt?. Equations (3.19) and (3.24) can be 
combined to give 


w(t) 


By ay OO eos (f (ear) (3.32) 
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so Eq.(3.31) becomes 


S = fae {ato cos (ff, wle “dt bw 


= [x(to)]? w(to) o w(t’) cos H w(t at” )at' (3.33) 


2T 

= [x(to)]” (to) f dé cos ?E = x [x(to)]? w(to) = const. 
0 

where € = i w(t" )dt" and dé = w(t')dt’. Equation (3.29) shows that S is the area in 

phase-space enclosed by the trajectory {x(t), v(t) } and Eq.(3.33) shows that for a slowly 

changing pendulum frequency, this area is a constant of the motion. Since the average 

energy of the pendulum scales as ~fw(t)x(t)]? , we see from Eq.(3.24) that the ratio 


pee Py w(t)a?(t) ~ S ~ const. (3.34) 

frequency 

The ratio in Eq.(3.34) is the classical equivalent of the quantum number NV of a simple 

harmonic oscillator because in quantum mechanics the energy Æ of a simple harmonic 
oscillator is related to the frequency by the relation E /hw = N + 1/2. 

This analysis clearly applies to any dynamical system having an equation of motion of 

the form of Eq.(3.17). Hence, if the dynamics of plasma particles happens to be of this 
form, then S' can be added to our repertoire of constants of the motion. 


3.4 Extension of WKB method to general adiabatic 
invariant 


Action has the dimensions of (canonical momentum) x (canonical coordinate) so we may 
anticipate that for general Hamiltonian systems, the action integral given in Eq.(3.29) is not 
an invariant because v is not, in general, proportional to P. We postulate that the general 
form for the action integral is 


S= $ PaQ (3.35) 


where the integral is over one period of the periodic motion and P,Q are the relevant 
canonical momentum-coordinate conjugate pair. The proof of adiabatic invariance used for 
Eq.(3.29) does not work directly for Eq.(3.35); we now present a slightly more involved 
proof to show that Eq.(3.35) is indeed the more general form of adiabatic invariant. 

Let us define the radius vector in the Q — P plane to be R = (Q, P) and define unit 
vectors in the Q and P directions by Q and Ê; these definitions are shown in Fig.3.1. Fur- 
thermore we define the z direction as being normal to the Q — P plane; thus the unit vector 
2 is ‘out of the paper’, i.e., 2 = Q x P. Hamilton’s equations [i.e., P = —OH/0Q, Q= 
OH/OP] may be written in vector form as 

dR 
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where 5 3 

V= QO + Pop (3.37) 
is the gradient operator in the Q — P plane. Equation (3.36) shows that the phase-space 
‘velocity’ dR/dt is orthogonal to V H. Hence, R stays on a level contour of H. If H is 
constant, then, in order for the motion to be periodic, the path along this level contour must 
circle around and join itself, like a road of constant elevation around the rim of a mountain 
(or a crater). If H is not constant, but slowly changing in time, the contour will circle 
around and nearly join itself. 


P P 
ô 
TS. 
constant H 
R= (Q, P) contour 
Q 


Figure 3.1: Q-P plane 


Equation (3.36) can be inverted by crossing it with Z to give 


dR 

dt ` 
For periodic and near-periodic motions, dR,/dt is always in the same sense (always clock- 
wise or always counterclockwise). Thus, Eq. (3.38) shows that an “observer” following 
the path would always see that H is increasing on the left hand side of the path and de- 
creasing on the right hand side (or vice versa). For clarity, the origin of the Q — P plane 
is re-defined to be at a local maximum or minimum of H. Hence, near the extremum H 

must have the Taylor expansion 


VH =2x (3.38) 


2792 
Z Ee (3.39) 


2 [8H 
H(P,Q) = extremum + 9 Q E 


e P=0,Q=0 2 zF] P=0,Q=0 
where [3H/ð3P?] P=0,Q=0 and [3 H/0Q?] P=0,9=0 ate either both positive (valley) or 
both negative (hill). Since H is assumed to have a slow dependence on time, these second 
derivatives will be time-dependent so that Eq.(3.39) has the form 


H =at) — +e) (3.40) 
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where a(t) and {(t) have the same sign. The term Hertremum in Eq.(3.39) has been 
dropped because it is just an additive constant to the energy and does not affect Hamilton’s 
equations. From Eq. (3.36) the direction of rotation of R is seen to be counterclockwise if 
the extremum of H is a hill, and clockwise if a valley. 

Hamilton’s equations operating on Eq.(3.40) give 
dP dQ 
—_ = — =aP. Al 
wl Ge =e 6.41) 
These equations do not directly generate the simple harmonic oscillator equation because 
of the time dependence of a, 3. However, if we define the auxiliary variable 


t 
re B(t')dt! (3.42) 
then 
d drd ¿d 
dt dtdr “dr 
so Eq.(3.41) becomes 
dP dQ a 
= = P. 3.43 
dr Q, dr (3:43) 


Taking the 7 derivative of the left equation above, and substituting the right hand equation 
gives 

P a 

—+s5P=0 3.44 

dr? ý B Oe) 
which now is a simple harmonic oscillator with w?(T) = a(r)/8(7). The action integral 
may be rewritten as 


a= | pea, (3.45) 
dT 


where the integral is over one period of the motion. Using Eqs.(3.43) and following the 
same procedure as was used with Eqs.(3.32) and Eq.(3.33), this becomes 


P A )@ om 


where A is a constant dependent on initial conditions. By introducing the orbit phase ¢ = 
J” (a/b)! aT, Eq.(3.46) becomes 


Ir 
S=) J. docos?ġ = const. (3.47) 
0 


Thus, the general action integral is indeed an adiabatic invariant. This proof is of course 
only valid in the vicinity of an extremum of H, i.e., only where H can be adequately 
represented by Eq.(3.40). 


3.4.1 General Proof for the General Adiabatic Invariant 


We now develop a proof for the general adiabatic invariant. This proof is not restricted 
to small oscillations (i.e., being near an extremum of H) as was the previous discussion. 
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Let the Hamiltonian depend on time via a slowly changing parameter A(t), so that H = 
A(P,Q, X(t)). From Eq.(3.16) the energy is given by 
E(t) = A(P,Q, A(t) (3.48) 


and, in principle, this relation can be inverted to give P = P(E(t), Q, A(t)). Suppose a 
particle is executing nearly periodic motion in the Q — P plane. We define the turning 
point Qip as a position where dQ/dt = 0. Since Q is oscillating there will be a turning 
point associated with Q having its maximum value and a turning point associated with Q 
having its minimum value. From now on let us only consider turning points where Q has 
its maximum value, that is we only consider the turning points on the right hand side of the 
nearly periodic trajectories in the Q — P plane shown in Fig.3.2. 


P 


I 
| 


Oy 


Figure 3.2: Nearly periodic phase space trajectory for slowly changing Hamiltonian. The 
turning Qip(t) point is where Q is at its maximum. 


If the motion is periodic, then the turning point for the N + 1°” period will be the same 


as the turning point for the N“” period, but if the motion is only nearly periodic, there will 
be a slight difference as shown in Fig.3.2. This difference can be characterized by making 
the turning point a function of time so Qip = Qip(t). This function is only defined for the 
times when dQ/dt = 0. When the motion is not exactly periodic, this turning point is such 
that Qip(t + T) Æ Qip(t) where 7 is the time interval required for the particle to go from 
the first turning point to the next turning point. The action integral is over one entire period 
of oscillation starting from a right hand turning point and then going to the next right hand 
turning point (cf. Fig. 3.2) and so can be written as 


f PAQ 


Qep(t+T) 
f PaQ. (3.49) 
Qtp (t) 


U 
II 
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From Eq.(3.16) it is seen that P/m is not, in general, the velocity and so the velocity 
dQ/d¢ is not, in general, proportional to P. Thus, the turning points are not necessarily 
at the locations where P vanishes, and in fact P need not change sign during a period. 
However, S' still corresponds to the area of phase-space enclosed by one period of the 
phase-space trajectory. 

We can now calculate 


S Qtp(t+T) 
= = ` f PaQ = 2 n P(E(t),Q, X(t))aQ 


Qtp(t) 
Qtp(t+7) Qep(t+T) 
d p OP 
z pR] z / (=) dQ (3.50) 
Qtp(t) Qtp(t) Q 
7 i. (=) dE (+) dà ag 
Qup(t) OE Qn dt Or QE dt 


Because dQ/dt = Oat the turning point, the integrated term vanishes and so there is no 
contribution from motion of the turning point. From Eq.(3.48) we have 


ax OH (3) (3.51) 
OP \ OE QA i 
and 
OH (2) OH 
0= } (3.52) 
OP \ OX QE Or 
so that Eq.(3.50) becomes 
ds ƏH\ [dE OHAA 
dt f (3) E aN Z| a eo?) 
From Eq.(3.48) we have 


dE OHAP . dHAQ ƏHdA _ OHA 
dt OP dt ` ðQ dt ` ðA dt A dt 


H (3.54) 
since the first two terms cancelled due to Hamilton’s equations. Substitution of Eq.(3.54) 
into Eq.(3.53) gives dS/dt = 0, completing the proof of adiabatic invariance. No assump- 
tion has been made here that P, Q are close to the values associated with an extremum of 
A. 

This proof seems too neat, because it has established adiabatic invariance simply by 
careful use of the chain rule, and by taking partial derivatives. However, this observation 
reveals the underlying essence of adiabaticity, namely it is the differentiability of H, P 
with respect to A from one period to the next and the Hamilton nature of the system which 
together provide the conditions for the adiabatic invariant to exist. If the motion had been 
such that after one cycle the motion had changed so drastically that taking a derivative of H 
or P with respect to A would not make sense, then the adiabatic invariant would not exist. 
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3.5 Drift equations 


We show in this section that it is possible to deduce intuitive and quite accurate analytic 
solutions for the velocity (drift) of charged particles in arbitrarily complicated electric and 
magnetic fields provided the fields are slowly changing in both space and time (this re- 
quirement is essentially the slowness requirement for adiabatic invariance). Drift solutions 
are obtained by solving the Lorentz equation 
dv 

my (E +v xB) (3.55) 
iteratively, taking advantage of the assumed separation of scales between fast and slow 
motions. 


3.5.1 Simple E x B and force drifts 


Before developing the general method for analyzing drifts, a simple example illustrating 
the basic idea will now be discussed. This example consists of an ion starting at rest in a 
spatially uniform magnetic field B =B2 and a spatially uniform electric field E = Ey. 
The origin is defined to be at the ion’s starting position and both electric and magnetic fields 
are constant in time. The assumed spatial uniformity and time-independence of the fields 
represent the extreme limit of assuming that the fields are slowly changing in space and 
time. 

Because the magnetic force qv x B is perpendicular to v, the magnetic force does no 
work and so only the electric field can change the ion’s energy (this can be seen by dotting 
Eq.(3.55) with v). Also, because all fields are uniform and static the electric field can be 
expressed as E = — V@ where ¢ = — Fy is an electrostatic potential. Since the ion lowers 
its potential energy g@ on moving to larger y, motion in the positive y direction corresponds 
to the ion “falling downhill”. Since the ion starts from rest at y = 0 where ¢ = 0, the total 
energy W = mv?/2 + q¢ is initially zero. Furthermore, the time-independence of the 
fields implies that W must remain zero for all time. Because the kinetic energy mv?/2 
is positive-definite, the ion can only attain finite kinetic energy if it falls downhill, i.e., 
moves into regions of positive y. If for any reason the ion y-coordinate becomes zero at 
some later time, then at such a time the ion would again have to have v = O because 
W = mv? — qEy = 0. 

When the ion begins moving, it will initially experience mainly the electric force qE-y 
because the magnetic force gv x B, being proportional to velocity, is negligible. The 
electric force accelerates the ion in the y direction so the ion develops a positive v, and 
also moves towards larger positive y as it “falls downhill” in the potential. As it develops 
a positive vy, the ion starts to experience a magnetic force quy x BZ = vyqB% which 
accelerates the ion in the positive x—direction causing the ion to develop a positive vz, 
in addition. The trajectory now becomes curved as the ion veers in the x direction while 
moving towards larger y. The positive vz continues to increase and as a consequence a new 
magnetic force qu,ĉx BZ = —v, qBy develops and, being in the negative y direction, this 
increasing magnetic force counteracts the steady electric force, eventually causing the ion 
to decelerate in the y direction. The velocity vy now decreases and ultimately reverses so 
that the ion starts to head in the negative y direction back towards y = 0. As a consequence 
of the reversal of vy, the magnetic force quy%jx BZ will become negative and so the ion 
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will also decelerate in the x-direction. Moving with negative v, means the ion is going 
uphill in the electrostatic potential and when it reaches y = 0, its potential energy must 
go back to zero. As noted above, the ion must come to rest at this point, because its total 
energy is always zero. Because the x velocity was never negative, the result of all this is 
that the ion makes a net positive displacement in the x direction. The whole process then 
repeats with the result that the ion keeps advancing in x while making a sequence of semi- 
circles in which v, oscillates in polarity while v, is never negative. The ion consequently 
moves like a leap-frog which bounces up and down in the y direction while continuously 
advancing in the x direction. If an electron had been used instead of an ion, the sign of 
both the electric and magnetic forces would have reversed and the electron would have 
been confined to regions where y < 0. However, the net displacement would also be in the 
positive x direction (this is easily seen by repeating the above argument using an electron). 


. . . . . . 


electeongy y i O 


. . . . . . . . . 


Figure 3.3: E x B drifts for particles having finite initial energy 


If an ion starts with a finite rather than a zero velocity, it will execute cyclotron (also 
called Larmor) orbits which take the ion into regions of both positive and negative y. How- 
ever, the ion will have a larger gyro-radius in its y > 0 orbit segment than in its y < 0 orbit 
segment resulting again in an average drift to the right as shown in Fig.3.3. Electrons have 
larger gyro-radii in the y < O portions of their orbit, but have a counterclockwise rotation 
so electrons also drift to the right. The magnitude of this steady drift is easily calculated by 
assuming the existence of a constant perpendicular drift velocity in the Lorentz equation, 
and then averaging out the cyclotron motion: 


0=E+(v) xB. (3.56) 
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This may be solved to give the average drift velocity 


ExB 

This steady ‘E cross B’ drift is independent of both the particle’s polarity and initial ve- 
locity. One way of interpreting this behavior is to recall that according to the theory 
of special relativity the electric field E’ observed in a frame moving with velocity u is 
E' = E + u x B and so Eq. (3.56) is simply a statement that a particle drifts in such a 
way to ensure that the electric field seen in its own frame vanishes. The ‘E cross B’ drift 
analysis can be easily generalized to describe the effect on a charged particle of any force 
orthogonal to B by simply making the replacement E — F/q in the Lorentz equation. 
Thus, any spatially uniform, temporally constant force orthogonal to B will cause a drift 


FxB 
VS 


aa (3.58) 


Equations (3.57) and (3.58) lead to two counter-intuitive and important conclusions: 
1. A steady-state electric field perpendicular to a magnetic field does not drive currents 
in a plasma, but instead causes a bulk motion of the entire plasma across the magnetic 
field with the velocity vz. 


2. A steady-state force (e.g., gravity, centrifugal force, etc.) perpendicular to the mag- 
netic field causes oppositely directed motions for electrons and ions and so drives a 
cross-field current 


FxB 
Jr=) no (3.59) 


3.5.2 Drifts in slowly changing arbitrary fields 


We now consider charged particle motion in arbitrarily complicated but slowly changing 
fields subject to the following restrictions: 
1. The time variation is so slow that the fields can be considered as approximately con- 
stant during each cyclotron period of the motion. 


2. The fields vary so gradually in space that they are nearly uniform over the spatial 
extent of any single complete cyclotron orbit. 


3. The electric and magnetic fields are related by Faraday’s law V x E = —0B/0t. 


4. E/B << cso that relativistic effects are unimportant (otherwise there would be a 
problem with vg becoming faster than c). 

In this more general situation a charged particle will gyrate about B, stream parallel to 

B, have ‘E x B’ drifts across B, and may also have force-based drifts. The analysis is based 

on the assumption that all these various motions are well-separated (easily distinguishable 

from each other); this assumption is closely related to the requirement that the fields vary 
slowly and also to the concept of adiabatic invariance. 

The assumed separation of scales is expressed by decomposing the particle motion 

into a fast, oscillatory component — the gyro-motion — and a slow component obtained by 
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averaging out the gyromotion. As sketched in Fig.3.4, the particle’s position and velocity 
are each decomposed into two terms 


x(t) = xel Hr v(t) =H = velt) + ve (6) (3.60) 


where rz (t) ,vz(t) give the fast gyration of the particle in a cyclotron orbit and x,<(t), 
Vgc(t) are the slowly changing motion of the guiding center obtained after averaging out 
the cyclotron motion. Ignoring any time dependence of the fields for now, the magnetic 
field seen by the particle can be written as 


B(x(é)) = B(x,,(t) +rz(¢)) 
= B(x,,(¢))+(rr(t)- V)B. (3.61) 


Because B was assumed to be nearly uniform over the cyclotron orbit, it is sufficient to 
keep only the first term in the Taylor expansion of the magnetic field. The electric field 
may be expanded in a similar fashion. 


particle 
actual 
trajectory 


guiding center trajectory 


x(t 


Figure 3.4: Drift in an arbitrarily complicated field 


After insertion of these Taylor expansions for the non-uniform electric and magnetic 
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fields, the Lorentz equation becomes 


mile +] _ y free, (0) + (ex)-V) 8] 


+4 lVoc(t) + vz (D] x [BCx,.(6)) + (r1 (t) -V) B 


(3.62) 
The gyromotion (i.e., the fast cyclotron motion) is defined to be the solution of the equa- 
tion 

dv L (t) 


m = qvr (t)xB(x,,(¢)); (3.63) 


subtracting this fast motion equation from Eq.(3.62) leaves 


dvgc(t 
mal) = q B(x, .(0) + (rz, (t) - V) E| 
+a Í voelt) x [B,(0) + (rx (0) - V)B] +v) x (L): Y)B}. 
(3.64) 
Let us now average Eq.(3.64) over one gyroperiod in which case terms linear in gyromotion 
average to zero. What remains is an equation describing the slow quantities, namely 
dv g(t) 
MILI = g{B(x,.(t))+Veclt) x B(x, (6)) + (vu(t) x (rz (t)-V)B)} 6-65) 
where () means averaged over a cyclotron period. The guiding center velocity can now be 
decomposed into components perpendicular and parallel to B, 


An 


Vgelt) = Vige(t) + Ujgc(t)B (3.66) 


so that 


~N 


dv,-(t) _ dvi gc(t) d (viselt) B) dv jgc(t) diget) = dB 
dt dt dt dt dt dt 
Denoting the distance along the magnetic field by s, the derivative of the magnetic field 


unit vector can be written, to lowest order, as 


daB aBds 


so Eq.(3.65) becomes 
dvige(t) , Wigelt)a 2 3 Gal _ 
i di B+vj,.B-VB| = qE(x,,(¢)) 
+9vV c(t) x B(x,,(¢)) 
+q (vi(t) x (rz (t) : V) B) - 
(3.69) 
The component of this equation along B is 
dvjjgc(t 
melee) =q |£\(x,.(t)) + (vr) x (ri (t) - V) B), (3.70) 
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while the component perpendicular to B is 


E (x(t 
Tagel) +vjj.B-VB) = q Re acy (3.71) 
+ (v(t) x (rr (t) - V) B) 1 

Equation (3.71) is of the generic form 
aL = F| +qv;: x B (3.72) 

dt 

where 
F, = q E1 (X(t) + (vz (6) x (rz (t)-V)B) 1. a 
mvj, 8: VB. 


Equation (3.72) is solved iteratively based on the assumption that vı gc has a slow time 
dependence. In the first iteration, the time dependence is neglected altogether so that the 
LHS of Eq.(3.72) is set to zero to obtain the ‘first guess’ for the perpendicular drift to be 


F, x B 
qB? ` 


V Lge X VF 


Next, vp is defined to be a correction to this first guess, where vp is assumed small and 
incorporates effects due to any time dependence of Vige. To determine vp, we write 
Vgc = Vr + Vp SO, to second order Eq. (3.72) becomes, 


d(vF + vp) 
m 
dt 


In accordance with the slowness condition, it is assumed that |dvp/dt| << |dvp/dt| so 
Eq.(3.74) becomes 


=F 4 q (VF H vp) x B. (3.74) 


d 
0= —m—F +qvp xB. (3.75) 


Crossing this equation with B gives the general polarization drift 


m dvp 


Vp = 


The most important example of the polarization drift is when vp is the Æ x B drift in a 
uniform, constant magnetic field so that 


md (ExB 
= B 71 
XR ae ( B? ) x Er 
o om dE 
qB? dt ` 


To calculate the middle term on the RHS of Eq.(3.73), it is necessary to average over 
cyclotron orbits (also called gyro-orbits or Larmor orbits). This middle term is defined as 
the ‘grad B’ force 


F a =q (v1 (t) x (rL(t) - V) B). (8.78) 
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To simplify the algebra for the averaging, a local Cartesian coordinate system is used with 
x axis in the direction of the gyrovelocity at t = O and zaxis in the direction of the 
magnetic field at the gyrocenter. Thus, the Larmor orbit velocity has the form 


v(t) = vzro [Ê cos wet — Ñ sin wet] (3.79) 
where A 
ee (3.80) 
m 
is called the cyclotron frequency and the Larmor orbit position has the form 
ry (t) = 2 [sin wet + ĝ coswet]. (3.81) 
We 


Inserting the above two expressions in Eq.(3.78) gives 


Foe =q 2 (jê coswet — sin wet] x ([ê sin wet +9 cosw,t]- V) B). (3.82) 
We 
Noting that (sin?w,t) = (cos*wet) = 1/2 while (sin (wet) cos (wet)) = 0, this reduces 
to 


2 
wo |. 0B .. OB 
F = 
n 2we ex Oy caer 
2B Oy Ox 
2 OB OB, OB, OB, 
= Mio of r, j ĉ ; (3.83) 
2B Oy Ox Oy Ox 
OB OB OB 
But from V - B = 0, it is seen that — 4 == = so the ‘grad B’ force is 
Oy Ox Oz 
2 
mv 
F = Pope ve (3.84) 


where the approximation B, ~ B has been used since the magnetic field direction is mainly 
in the Z direction. 
Let us now define aA ees 

F. = —mvj,-B - VB (3.85) 
and consider this force. Suppose that the magnetic field lines have curvature and consider 
a particular point on a specific field line. Define, as shown in Fig.3.5, a two-dimensional 
cylindrical coordinate system (R, p) with origin at the field line center of curvature for this 
specific point and lying in the plane of the field line at this point. Then, the radial position 
of the chosen point in this cylindrical coordinate system is the local radius of curvature of 
the field line and, since o= B, it is seen that B- VB = ov o= -Ê/ R. Thus, the force 
associated with curvature of a field line 

mo? R 
F. = R (3.86) 

is just the centrifugal force resulting from the motion along the curve of the particle’s 
guiding center. 
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Figure 3.5: Local cylindrical coordinate system defined by curved magnetic field, @ = B. 


The drifts can be summarized as 


Vige=VE+VVB+Ve+vVp 


where 
1. the ‘E cross B’ drift is 
= ExB 
VE = B2 
2. the ‘grad B’ drift is : 
VVB = eee xB 


3. the ‘curvature’ drift is 


qB? qB? 


mv -oo o 1 mv R 
van WE Vp xB ls |B 


4. the ‘polarization’ drift is 


(3.87) 


(3.88) 


(3.89) 


(3.90) 


(3.91) 
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3.5.3 conservation 


We now imagine being in a frame moving with the velocity v1 gc; in this frame the only 
perpendicular velocity is the cyclotron velocity (Larmor motion). Since v | gc is orthogonal 
to B, the parallel equation of motion is not affected by this change of frame and using 
Eqs.(3.70) and (3.84) can be written as 


(3.92) 


where as before, s is the distance along the magnetic field. Multiplication by v gives an 
energy relation 


a [muy mv?o OB 

The perpendicular force defined in Eq.(3.73) does not exist in this moving frame because it 
has been ‘transformed away’ by the change of frames. Also, recall that it was assumed that 
the characteristic scale lengths of E and B are large compared to the gyro radius (Larmor 
radius). However, if the magnetic field has an absolute time derivative, Faraday’s law states 
that there must be an inductive electric field, i.e., an electric field for which $ E-dl Æ 0. 
This is distinct from the static electric field that has been previously assumed and so its 
consequences must be explicitly taken into account. 

To understand the effect of an inductive electric field, consider a specific particle, and 
dot the Lorentz equation with v to obtain 


2 
d f mv; l mu? 
2 2 


ai a) = quy Ey +v -EL (3.94) 
where v] is the vector Larmor orbit velocity. Subtracting Eq.(3.93) from (3.94) gives 


d mv?o mv? 4 OB 
— = -E —. 3.95 
a 2 ) a et oi all Og Ge 


Integration of Faraday’s law over the cross-section of the Larmor orbit gives 


posvxE=- fa E (3.96) 


or 
OB 
dl- E = —ar? — 3.97 
f TE (3.97) 


where it has been assumed that the magnetic field is changing sufficiently slowly for the 
orbit radius to be approximately constant during each orbit. 

Equation (3.95) involves the local electric field E, but Eq.(3.97) only gives the line 
integral of the electric field. This line integral can still be used if Eq.(3.95) is averaged over 
a cyclotron period. The critical term is the time average over the Larmor orbit of gv -E1 
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(which gives the rate at which the perpendicular electric field does work on the particle), 


We 
<v- El Sortit = 5> fa qv, -El 
2T 
Z =e fae (3.98) 
27T 
_ We OB 
2 son: 
The substitution v ; dt = —dl has been used and the minus sign is invoked because particle 


motion is diamagnetic (e.g., ions have a left-handed orbit, whereas in Stokes’ theorem dl 
is assumed to be a right handed line element). Averaging of Eq. (3.95) gives 


d mv? = mv? OB l moio, OB O mv? 4 dB (3.99) 
dt \ 2 ~ 2B a' 2B ðs 2B dt i 


where dB /dt = 0B/Ot + vjðB/ðəs is the total derivative of the average magnetic field 
experienced by the particle over a Larmor orbit. Defining the Larmor orbit kinetic energy 
as W1 = mv? o/2, Eq.(3.99) can be rewritten as 


1 dW, 1dB 
= 1 
W, dt Ba (3.100) 


which has the solution y 

— = p= const. (3.101) 
for magnetic fields that can be changing in both time and space. In plasma physics terminol- 
ogy, pis called the ‘first adiabatic’ invariant, and the invariance of u shows that the ratio of 
the kinetic energy of gyromotion to gyrofrequency is a conserved quantity. The derivation 
assumed the magnetic field changed sufficiently slowly for the instantaneous field strength 
B(t) during an orbit to differ only slightly from the orbit-averaged field strength (B) the 
orbit, i.e., |B(t) — (B) | << (B). 


3.5.4 Relation of : conservation to other conservation relations 


p conservation is both of fundamental importance and a prime example of the adiabatic 
invariance of the action integral associated with a periodic motion. The u conservation 
concept unites together several seemingly disparate points of view: 

1. Conservation of magnetic moment of a particle- According to electromagnetic theory 
the magnetic moment m of a current loop is m = JA where I is the current carried in 
the loop and A is the area enclosed by the loop. Because a gyrating particle traces out 
a circular orbit at the frequency w,/27 and has a charge q, it effectively constitutes 
a current loop having J = qw,/27 and cross-sectional area A = mre, Thus, the 
magnetic moment of the gyrating particle is 


2 
qwe 2 Mio 
ze) N: — l 2 

= Ca ad nn) 
and so the magnetic moment m is an adiabatically conserved quantity. 
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2. Conservation of magnetic flux enclosed by gyro-orbit- Because the magnetic flux ® 
enclosed by the gyro-orbit is 


2 
B= Brr? = (=) u, (3.103) 


p conservation further implies conservation of the magnetic flux enclosed by a gyro- 
orbit. This is consistent with the concept that the magnetic flux is frozen into the 
plasma, since if the field is made stronger, the field lines squeeze together such that 
the density of field lines per area increases proportional to the field strength. As shown 
in Fig.3.6, the particle orbit area contracts in inverse proportion to the field strength so 
that after a compression of field, the particle orbit links the same number of field lines 
as before the compression. 


3. Hamiltonian point of view (cylindrical geometry with azimuthal symmetry)- Define 
a cylindrical coordinate system (r, 0, z) with z axis along the axis of rotation of the 
gyrating particle. Since B, = r~10(rAg)/Or the vector potential is Ag = rB,/2. 
The velocity vector is v =rr + r00 + 22 and the Lagrangian is 


L= m (> 20° 4 2) + gr Ag — qh (3.104) 
so that the canonical angular momentum is 
Pp =mr?04+ qr Ag = mr? + qr? B, /2. (3.105) 
Since particles are diamagnetic, 0 = —w,. Because of the azimuthal symmetry, P9 
will be a constant of the motion and so 
const. = P) = —mr?we + qr? B /2 = ie = aria (3.106) 


This shows that constancy of canonical angular momentum is equivalent to u conser- 
vation. It is important to realize that constancy of angular momentum due to perfect 
axisymmetry is a much more restrictive assumption than the slowness assumption 
used for adiabatic invariance. 


4. Adiabatic gas law- The pressure associated with gyrating particles has dimensionality 
N = 2,i.e., P = (m/2) fv v' fd?v where v’ =v,ê + v Ñ and the x — y plane is 
the plane of the gyration. Also the density for a two dimensional system has units of 
particles/area, i.e. n ~ 1/A. Hence, the pressure will scale as P ~ v2, /A. Since 
y = (N + 2)/N = 2, the adiabatic law, Eq.(2.37), gives 


P 2 
const. ~ 55 ~ ALA (8.107) 


but from the flux conservation property of orbits A ~ 1/B so Eq.(3.107) becomes 


(3.108) 


which is again proportional to x since v, is proportional to the mean perpendicular 


thermal energy, i.e., the average of the gyrational energies of the individual particles 
making up the fluid. 
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Figure 3.6: Illustration showing how conservation of flux linked by an orbit is equivalent to 
frozen-in field; also increasing magnetic field results in magnetic compression. 


3.5.5 Magnetic mirrors- a consequence of u conservation 


Consider a charged particle moving in a static, but spatially nonuniform magnetic field. 
The non-uniformity is such that the field strength varies in the direction of the field line so 
that 0B/Os + 0 where s is the distance along a field line. Such a field cannot be straight 
because if it were and so had the form B =B,(z)z, it would necessarily have a non-zero 
divergence, i.e., it would have V - B = 0B,/0z 4 0. Because magnetic fields must have 
zero divergence there must be another component besides B, and this other component 
must be spatially non-uniform also in order to contribute to the divergence. Hence the field 
must be curved if the field strength varies along the direction of the field. 

This curvature is easy to see by sketching field lines, as shown in Fig.3.7. The density 
of field lines is proportional to the strength of the magnetic field and so a gradient of field 
strength along the field means that the field lines squeeze together as the field becomes 
stronger. Because magnetic field lines have zero divergence they are endless and so must 
bend as they squeeze together. This means that if 0B, /Oz Æ 0 there must also be a field 
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transverse to the initial direction of the magnetic field, i.e., a field in the x or y directions. 
In a cylindrically symmetric system, this transverse field must be a radial field as indicated 
by the vector decomposition B =B,z + B,?r in Fig.3.7. 


field lines squeezed 
together 


Figure 3.7: Field lines squeezing together when B has a gradient. B field is stronger on the 
right than on the left because density of field lines is larger on the right. 


The magnetic field is assumed to be static so that V x E = 0 in which case E = — Vo 
and Eq.(3.92) can be written as 


dv Ob OB 
m =-q 


dt ðs as" Gi) 
Multiplying Eq.(3.109) by v gives 
d muy 
ulo tqo +uB| =0, (3.110) 


assuming that the electrostatic potential is also constant in time. Time integration gives 


mvi 
2 


Thus, uB (s) acts as an effective potential energy since it adds to the electrostatic potential 
energy g@(s). This property has the consequence that if B(s) has a minimum with respect 
to s as shown in Fig.3.8, then uB acts as an effective potential well which can trap particles. 
A magnetic trap of this sort can be produced by two axially separated coaxial coils. On each 
field line B(s) has at locations sı and s2 maxima near the coils, a minimum at location so 


+ qels) + B(s) = const. (3.111) 
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between the coils, and B(s) tends to zero as s — too. To focus attention on magnetic 
trapping, suppose now that no electrostatic potential exists so Eq.(3.111) reduces to 


mvi 


or + uB(s) = const. (3.112) 


Now consider a particle with parallel velocity vo located at the well minimum so at time 
t = 0. Evaluating Eq.(3.112) at s = 0,¢ = O and then again when the particle is at some 
arbitrary position s gives 


mu7(s) MUjo m (03, a vo) 
TORE + nB(s) = 1° + B59) = — 


where Wo is the particle’s total kinetic energy att = 0. Solving Eq.(3.113) for vj (s) gives 


vy(s) = £4 [= [Wo — uB(s)]. (3.114) 


If uB(s) = Wo at some position s, then v (s) must vanish at this position in which case 
the particle must reverse its direction of motion just like a pendulum reversing direction 
when its velocity goes through zero. This velocity reversal corresponds to a reflection of 
the particle and so this configuration is called a magnetic mirror. A particle can be trapped 
between two magnetic mirrors; such a configuration is called a magnetic trap or a magnetic 
well. 


= Wo (3.113) 


“potential” “potential” 
B hill hill 


/ “potential” 
valley 


Figure 3.8: Magnetic mirror 


If Wo > Bmax Where Bmax is the magnitude at sı 2 then the velocity does not go to 
zero at the maximum amplitude of the mirror field. In this case the particle does not reflect, 
but instead escapes over the peak of the uB(s) potential hill and travels out to infinity. 
Thus, there are two classes of particles: 
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1. trapped particles — these have Wo < uBmax and bounce back and forth between the 
mirrors of the magnetic well, 


2. untrapped (or passing) particles — these have Wo > uBmax and are retarded at the 
potential hills but not reflected. 
Since u = mv? 9/2Bmin and Wo = mvé/2, the criterion for trapping can be written 
as 
Brin < Vio (3.115) 
Bmax v 
Let us define @ as the angle the velocity vector makes with respect to the magnetic field at 
So, i.e., sin? = v1 o /vo and also define 


1 Bmin 
Boaz 


(3.116) 


Otrap = sin ~ 


Thus, as shown in Fig.3.9 all particles with 0 > trap are trapped, while all particles with 
0 < Oirap are untrapped. Suppose at t = O the particle velocity distribution at sọ is 
isotropic. After a long time interval long enough for all untrapped particles to have escaped 
the trap, there will be no particles in the 0 < Orap region of velocity space. The velocity 
distribution will thus be zero for 0 < trap; such a distribution function is called a /oss-cone 
distribution function. 


loss mirror 
cone trapped 


Oira 


Figure 3.9: Loss-cone velocity distribution. Particles with velocity angle 0 > Qirap are 
mirror trapped, others are lost. 


3.5.6 J, the Second Adiabatic Invariant 


Trapped particles have periodic motion in the magnetic well, and so applying the concept 
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of adiabatic invariance presented in Sec.3.4.1, the quantity 


L= $ Pias = f (rm + qAj)ds (3.117) 


will be an invariant if 
1. any time dependence of the well shape is slow compared to the bounce frequency of 
the trapped particle, 


2. any spatial inhomogeneities of the well magnetic field are so gradual that the particle’s 
bounce trajectory changes by only a small amount from one bounce to the next. 
To determine the circumstances where A) # 0, we use Coulomb gauge (i.e., assume 
V - A =0) and at any given location define a local Cartesian coordinate system with z axis 
parallel to the local field. From Ampere’s law it is seen that 


[V x (V XA) pS —V?A; = wJ: (3.118) 


so A, is finite only if there is a current parallel to the magnetic field. Because J, acts as 
the source term in a Poisson-like partial differential equation for A,, the parallel current 
need not be at the same location as A,. If there are no currents parallel to the magnetic 
field anywhere then A, = 0, and in this case the second adiabatic invariant reduces to 


J =m f yas, (3.119) 


Having a current flow along the magnetic field corresponds to a more complicated magnetic 
topology. The axial current produces an associated azimuthal magnetic field which links 
the axial magnetic field resulting in a helical twist. This more complicated situation of 
finite magnetic helicity will be discussed in a later chapter. 


3.5.7 Consequences of J -invariance 


Just as u invariance was related to the perpendicular CGL adiabatic invariant discussed 
in Sec.(2.101), J -invariance is closely related to the parallel CGL adiabatic invariant also 
discussed in Sec.(2.101). To see this relation, recall that density in a one dimensional 
system has dimensions of particles per unit length, i.e., 215 ~ 1/L, and pressure in a one 
dimensional system has dimensions of kinetic energy per unit length, i.e., Pip ~ vi /L. 
For parallel motion the number of dimensions is N = 1 so that y = (N + 2)/N = 3and 
the fluid adiabatic relation is 


~ (vL)? (3.120) 


2 
Pip vy /L 
const. ~ =a 


err 


which is a simplified form of Eq.(3.119) since Eq.(3.119) has the scaling 7 ~w L = const. 

J -invariance combined with mirror trapping/detrapping is the basis of an acceleration 
mechanism proposed by Fermi (1954) as a means for accelerating cosmic ray particles to 
ultra-relativistic velocities. The Fermi mechanism works as follows: Consider a particle 
initially trapped in a magnetic mirror. This particle has an initial angle in velocity space 
0 > Oirap; both 0 and Otrap are measured when the particle is at the mirror minimum. Now 
suppose the distance between the magnetic mirrors is slowly reduced so that the bounce 
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distance L of the mirror-trapped particle slowly decreases. This would typically occur by 
reducing the axial separation between the coils producing the magnetic mirror field. Be- 
cause J ~ v)L is invariant, the particle’s parallel velocity increases on each successive 
bounce as L slowly decreases. This steady increase in v means that the velocity angle 
0 decreases. Eventually, 0 becomes smaller than 6;,,, whereupon the particle becomes 
detrapped and escapes from one end of the mirror with a large parallel velocity. This mech- 
anism provides a slow pumping to very high energy, followed by a sudden and automatic 
ejection of the energetic particle. 


3.5.8 The third adiabatic invariant 


Consider a particle bouncing back and forth in either of the two geometries shown in 
Fig.3.10. In Fig.3.10(a), the magnetic field is produced by a single magnetic dipole and 
the field lines always have convex curvature, i.e. the radius of curvature is always on the 
inside of the field lines. The field decreases in magnitude with increasing distance from the 
dipole. 


(a) (b) 


Figure 3.10: Magnetic field lines relevant to discussion of third adiabatic invariant: (a) field 
lines always have same curvature (dipole field), (b) field lines have both concave and con- 
vex curvature (mirror field). 


In Fig.3.10(b) the field is produced by two coils and has convex curvature near the 
mirror minimum and concave curvature in the vicinity of the coils. On defining a cylindrical 
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coordinate system (r,9,z) with z axis coaxial with the coils, it is seen that in the region 
between the two coils where the field bulges out, the field strength is a decreasing function 
of r, i.e. OB/Or < 0, whereas in the plane of each coil the opposite is true. Thus, in the 
mirror minimum, both the centrifugal and grad B forces are radially outward, whereas the 
opposite is true near the coils. 

In both Figs. 3.10(a) and (b) a particle moving along the field line can be mirror- 
trapped because in both cases the field has a minimum flanked by two maxima. However, 
for Fig.3.10(a), the particle will have grad B and curvature drifts always in the same az- 
imuthal sense, whereas for Fig.3.10(b) the azimuthal direction of these drifts will depend 
on whether the particle is in a region of concave or convex curvature. Thus, in addition to 
the mirror bouncing motion, much slower curvature and grad B drifts also exist, directed 
along the field binormal (i.e. the direction orthogonal to both the field and its radius of 
curvature). These higher-order drifts may alternate sign during the mirror bouncing. The 
binormally directed displacement made by a particle during its it” complete period 7 of 


mirror bouncing is 
T 


ôrj= | vdt (3.121) 


where 7 is the mirror bounce period and v is the sum of the curvature and grad B drifts 
experienced in the course of a mirror bounce. This displacement is due to the cumulative 
effect of the curvature and grad B drifts experienced during one complete period of bounc- 
ing between the magnetic mirrors. The average velocity associated with this slow drifting 
may be defined as 


T 


1 
(v) =- vdt. (3.122) 
T Jo 
Let us calculate the action associated with a sequence of ôr;, This action is 


S= So [m(v) + 4A], - ôr; (3.123) 
j 


where the quantity in square brackets is evaluated on the line segment ôr;. If the ôr; are 
small then this can be converted into an action ‘integral’ for the path traced out by the 
ôr;. If the ôr; are sufficiently small to behave as differentials, then we may write them as 
dV pounce and express the summation as an action integral 


S= I [m (v) + qA] - drbounce (3.124) 


where it must be remembered that (v) is the bounce-averaged velocity. The quantity 
m (v} + qA is just the canonical momentum associated with the effective motion along 
the sequence of line segments dr; . The vector rpounce is a vector pointing from the origin 
to the particle’s location at successive bounces and so is the generalized coordinate asso- 
ciated with the bounce averaged velocity. If the motion resulting from (v) is periodic, we 
expect © to be an adiabatic invariant. The first term in Eq.(3.124) will be of the order of 
Mari fi2T7r where r is the radius of the trajectory described by the dr;. The second term is 
just q® where ® is the magnetic flux enclosed by the trajectory. Let us compare the ratio 
of these two terms 

fJ miv) -dr  mvarige2ar _ Vdrift r? 


~ 


fqA-dr qBrr2 Wer r2 


(3.125) 
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where we have used vy g ~ Ve ~ v? [wer ~ wer? /r. Thus, if the Larmor radius is much 
smaller than the characteristic scale length of the field, the magnetic flux term dominates 
the action integral and adiabatic invariance corresponds to the particle staying on a constant 
flux surface as its orbit evolves following the various curvature and grad B drifts. This third 
adiabatic invariant is much more fragile than J, which in turn was more fragile than u, 
because the analysis here is based on the rather strong assumption that the curvature and 
grad B drifts are small enough for the ôr; to trace out a nearly periodic orbit. 


3.6 Relation of Drift Equations to the Double Adiabatic 
MHD Equations 


The derivation of the MHD Ohm’s law involved dropping the Hall term (see p. 48) and 
the basis for dropping this term was assuming that w << wei where w is the characteris- 
tic rate of change of the electromagnetic field. The derivation of the single particle drift 
equations involved essentially the same assumption (i.e., the motion was slow compared to 
Weg). Thus, if the characteristic rate of change of the electromagnetic field is slow com- 
pared to wei both the MHD and the single particle drift equations ought to be equally valid 
descriptions of the plasma dynamics. If so, then there also ought to be some sort of a cor- 
respondence relation between these two points of view. Some evidence supporting this 
hypothesis was the observation that the single particle adiabatic invariants u and J were 
respectively related to the perpendicular and parallel double adiabatic MHD equations. It 
thus seems reasonable to expect additional connections between the drift equations and 
the double adiabatic MHD equations. 

In fact, an approximate derivation of the double adiabatic MHD equations can be ob- 
tained by summing the currents associated with the various particle drifts — providing one 
additional effect, diamagnetic current, is added to this sum. Diamagnetic current is a pe- 
culiar concept because it is a consequence of the macroscopic phenomenon of pressure 
gradients and so has no meaning in the context of a single particle description. 

In order to establish this microscopic-macroscopic relationship we begin by recalling 
from electromagnetic theory” that a magnetic material with density M of magnetic dipole 
moments per unit volume has an associated magnetization current 


Ju =VXM. (3.126) 


The magnitude of the magnetic moment of a charged particle in a magnetic field was shown 
in Sec.3.5.4 to be u. The magnetic moment of a magnetic dipole is a vector pointing in 
the direction of the magnetic field produced by the dipole. The vector magnetic moment 
of a charged particle gyrating in a magnetic field is m = —j.B where the minus sign 
corresponds to cyclotron motion being diamagnetic, i.e., the magnetic field resulting from 
cyclotron rotation opposes the original field in which the particle is rotating. For example, 
an individual ion placed in a magnetic field B =B2 rotates in the negative 0 direction, and 
so the current associated with the ion motion creates a magnetic field pointing in the —Z 
direction inside the ion orbit. 


?For example, see p. 192 of (Jackson 1998). 
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Suppose there exists a large number or ensemble of particles with density n, and mean 
magnetic moment jz. The density of magnetic moments, or magnetization density, of this 
ensemble is 


3 3 
_ ee Movi NA P, B 

M=- > Noh B = — > ne ( 2B )B=- B (3.127) 

where () denotes averaging over the velocity distribution and Eq.(2.26) has been used. 


Inserting Eq.(3.127) into Eq.(3.126) shows that this ensemble of charged particles in a 
magnetic field has a diamagnetic current 
P\B 
—— ]. 3.128 
(=>) 6129) 
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Figure 3.11: Gradient of magnetized particles gives apparent current as observed on dashed 
line. 


Figure 3.11 shows the physical origin of J q. Here, a collection of ions all rotate clock- 
wise in a magnetic field pointing out of the page. The azimuthally directed current on the 
dashed curve is the sum of contributions from (i) particles with guiding centers located 
one Larmor radius inside the dashed curve and (ii) particles with guiding centers located 
one Larmor radius outside the dashed curve. From the point of view of an observer lo- 
cated on the dashed curve, the inside particles [group (1)] constitute a clockwise current, 
whereas the outside [group (11)] particles constitute a counterclockwise current. If there are 
unequal numbers of inside and outside particles (indicated here by concentric circles inside 
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the dashed curve), then the two opposing currents do not cancel and a net macroscopic cur- 
rent appears to flow around the dashed curve, even though no actual particles flow around 
the dashed curve. Inequality of the numbers of inside and outside particles corresponds to 
a density gradient and so we see that a radial density gradient of gyrating particles gives a 
net macroscopic azimuthal current. Similarly, if there is a radial temperature gradient, the 
velocities of the inner and outer groups differ, resulting again in an apparent macroscopic 
azimuthal current. The combination of density and temperature gradients is such that the 
net macroscopic current depends on the pressure gradient as given by Eq.(3.128). 

Taking diamagnetic current into account is critical for establishing a correspondence 
between the single particle drifts and the MHD equations, and having recognized this, 
we are now in a position to derive this correspondence. In order for the derivation to be 
tractable yet non-trivial, it will be assumed that the magnetic field is time-independent, but 
the electric field will be allowed to depend on time. It is also assumed that the dominant 
cross-field particle motion is the vg = E x B/B? drift; this assumption is consistent with 
the hierarchy of particle drifts (i.e., polarization drift is a higher-order correction to vg). 

Because both species have the same v g, no macroscopic current results from vg, and 
so all cross-field currents must result from the other, smaller drifts, namely vy g, Ve, and 
Vp- Let us now add the magnetization current to the currents associated with vy g, Ve, and 
Vp to obtain the total macroscopic current 


Jiotaa = Jm +Jvg + Je +Jp = Jm + X noqo (Uv B, + Uco + Upo) (3.129) 


o 


where Jy g, Je, Jp are currents due to grad B, curvature, and polarization drifts respec- 
tively and uy go, Uc,¢ and up, o are the mean (i.e., fluid) velocities associated with these 
drifts. These currents are explicitly: 

1. grad B current 


Sve = `}, NnoqloUyB, o 
= ay Mettete (vi,) VBxB _ VBxB (3.130) 
= g 2B qB? ~ + B 
2. curvature current 
J. = oF Ne Ia Uc, o 
B-YBxB B-VBxB (3.131) 
SoS ae No dao Mo (v2) qo B? = P B2 
3. polarization current 
_ = Mo dE, = P dE, 
J, = 9 Modern. = 2 neto (5 7 ) Sa F (3.132) 


Because the magnetic field was assumed to be constant, the time derivative of vg is the 
only contributor to the polarization drift current. 
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The total magnetic force is 


Jota XB = (Jm +Jvg+J:+Jp)xB 
P, B VB xB 
vx| 5 ] -r (3.133) 
= \ s x B. 
B-VBxB p dE 
|B? "B? at 
The grad B current cancels part of the magnetization current as follows: 
P, B VB xB 1 ae Ie : 
v HP = |v(5)xPLÊ+=Vx (PĒ) 
x ( B L B3 | (a x PL b+ B x dik | 
VBxB 
at apes 
1 3 P ~ VPEXB 
a SVX (PB) = epee oes 
B B B 
(3.134) 
so that 
ts A z > ~ pqdE ^ 
Jiotal x B=- P\VxB+VPL xB+AB-VBxB- p x B. (3.135) 
The first term can be recast using the vector identity 
B-B x PEE z 
v (5) -052 v+8xvxA (3.136) 
while the electric field can be replaced using E = —U x B to give 
> > pda(UxB) .- 
Jota X B = — (P, — Pi) B -VB +V 1P, - = —— xB. .137 
total X (Pr= Py SV GE ae (3.137) 
Here the relation |B- VB Pa Ê - VB has been used; this relation follows from Eq. 
(3.136). Finally, it is observed that 
lv: (88)] = (vV-8)B+B-vB] =B-VB (3.138) 


sO 


(P. - P) -VÊ =(P.-7)|V-(88)) = |V: {(PL- P) BB}. 6139 
Furthermore, 
pd(UxB) . dU 
ule Bun |p 
Bod ” Pai 
since it has been assumed that the magnetic field is time-independent. Inserting these last 
two results in Eq.(3.137) gives 


(3.140) 


dU 
Joni X B= lv. Ta - Pj) Ê X +V, P, + a (3.141) 
L dt |, 
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or 

a = [Feotat xB-V- {P T+ (Pi — P)) BB\ 7 (3.142) 
which is just the n component of the double adiabatic MHD equation of mo- 
tion. This demonstrates that if diamagnetic current is taken into account, the drift equations 
for phenomena with characteristic frequencies w much smaller than we; and the double adi- 
abatic MHD equations are equivalent descriptions of plasma dynamics. This analysis also 
shows that one has to be extremely careful when invoking single particle concepts to ex- 
plain macroscopic behavior, because if diamagnetic effects are omitted, erroneous conclu- 
sions can result. 

The reason for the name polarization current can now be addressed by comparing this 
current to the current flowing through a parallel plate capacitor with dielectric e. The ca- 
pacitance of the parallel plate capacitor is C = £ A/d where A is the cross-sectional area 
of the capacitor plates and w is the gap between the plates. The charge on the capacitor 
is Q = CV where V is the voltage across the capacitor plates. The current through the 
capacitor is I = dQ/dt so 
cAdV 


dV 
[=C—= 14 
di (3.143) 


d dt 
However the electric field between the plates is # = V/d and the current density is J = 


I/A so this can be expressed as 
dE 
= eae (3.144) 
which gives the alternating current density in a medium with dielectric e. If this is compared 


to the polarization current 


dE 
Jp = aig (3.145) 
it is seen that the plasma acts like a dielectric medium in the direction perpendicular to the 
magnetic field and has an effective dielectric constant given by p/B?. 
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Adiabatic behavior occurs when temporal or spatial changes in the electromagnetic field 
from one cyclical orbit to the next are sufficiently gradual to be effectively continuous 
and differentiable (i.e., analytic). Thus, adiabatic behavior corresponds to situations where 
variations of the electromagnetic field are sufficiently gradual to be characterized by the 
techniques of calculus (differentials, limits, Taylor expansions, etc.). 

Non-adiabatic particle motion occurs when this is not so. It is therefore no surprise that 
it is usually not possible to construct analytic descriptions of non-adiabatic particle motion. 
However, there exist certain special situations where non-adiabatic motion can be described 
analytically. Using these special cases as a guide, it is possible to develop an understanding 
for what happens when motion is non-adiabatic. 

One special situation is where the electromagnetic field is geometrically symmetric with 
respect to some coordinate Q; in which case the symmetry makes it possible to develop 
analytic descriptions of non-adiabatic motion. This is because symmetry in Q; causes the 
canonical momentum P; to be an exact constant of the motion. The critical feature is that 
Pj remains constant no matter how drastically the field changes in time or space because 
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Lagrange’s equation P; = —0OL/0Q, has no limitations on the rate at which changes can 
occur. In effect, being geometrically symmetric trumps being non-analytic. The absolute 
invariance of P; when OL/0Q,; = 0 reduces the number of equations and allows a partial 
or sometimes even a complete solution of the motion. Solutions to symmetric problems 
give valuable insight regarding the more general situation of being both non-adiabatic and 
asymmetric. 

Two closed related examples of non-adiabatic particle motion will now be analyzed: (i) 
sudden temporal and (ii) sudden spatial reversal of the polarity of an azimuthally symmetric 
magnetic field having no azimuthal component. The most general form of such a field can 
be written in cylindrical coordinates (r, 0, z) as 


B= u(r, zt) x V6; (3.146) 


a field of this form is called poloidal. Rather than using 0 explicitly, the form V@ has been 
used because Vô is better suited for use with the various identities of vector calculus (e.g., 
VxV06 = 0) and leads to greater algebraic clarity. The relationship between V0 and 0 is 
seen by simply taking the gradient: 


v= (a En ag ont (3.147) 


Equation(3.146) automatically satisfies V - B = 0 [by virtue of the vector identity 
V - (G x H) =H-VxG-— G-V xH], has no 0 component, and is otherwise arbitrary 
since 7 is arbitrary. As shown in Fig.3.12, the magnetic flux linking a circle of radius r 
with center at axial position z is 


á 1 
f Bas = i Qardrz - Fawn x VO 
T 


0 
(3.148) 
r oy t 
= / dr aU = U(r, z,t) — (0, z,t). 
0 Or 
However, 
1 Ow 

B,(r, z,t) = -——— — 14 

(a 2mr Oz Oe) 


and since V - B =0, B, must vanish at r = 0, and so Ow /0z = 0 on the symmetry axis 
r=0. 
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Figure 3.12: Azimuthally symmetric flux surface 


Thus Yis constant along the symmetry axis r = 0; for convenience we choose this 
constant to be zero. Hence, y(r, z,t) is precisely the magnetic flux enclosed by a circle of 
radius r at axial location z. We can also use the vector potential A to calculate the magnetic 
flux through the same circle and obtain 


27 
[Bs = [Praos = faa = i Agrdé = 2rr Ag. (3.150) 
0 


This shows that the flux ~ and the vector potential Ag are related by 


W(r, z,t) = 2rr Ao. (3.151) 


No other component of vector potential is required to determine the magnetic field and so 


we may set A =Ag(r, z, t)0. 
The current J =o 1Y x B producing this magnetic field is purely azimuthal as can be 
seen by considering the r and z components of V x B. The actual current density is 


Jo = Uy 'rVO-VxB 
= po'rV-(BxV6) 


r 1 
— = V 4 —V i 
2T uo € v) 


r [a (1d). 18y 
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a Poisson-like equation. Since no current loops can exist at infinity, the field prescribed 
by Eq.(3.146) must be produced by a set of coaxial coils having various finite radii r and 
various finite axial positions z. 

The axial magnetic field is 


1 Ow 
B EE 3.153 
2rr Or ( ) 
Near r = 0, w can always be Taylor expanded as 
y(r =0 2 u(r =0 
w(r,z) =O+4r EO) at OG (3.154) 


Or 2 ðr? 


Suppose that 07) /Or is non-zero at r = 0, i.e., YW ~ r near r = 0. If this were the case, 
then the first term in the right hand side of the last line of Eq.(3.152) would become infinite 
and so lead to an infinite current density at r = 0. Such a result is non-physical and so we 
require that the first non-zero term in the Taylor expansion of 7 about r = 0 to be the r? 
term. 

Every field line that loops through the inside of a current loop also loops back in the 
reverse direction on the outside, so there is no net magnetic flux at infinity. This means 
that % must vanish at infinity and so as r increases, 7 increases from its value of zero at 
r = 0 to some maximum value Ymax at r = Tmax, and then slowly decreases back to zero 
as r — oo. As seen from Eq.(3.153) this behavior corresponds to B, being positive for 
r < Tmax and negative for r > fmax- A contour plot of the y(r, z) flux surfaces and a plot 
of y(r, z = 0) versus r is shown in Fig.3.13. 
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Figure 3.13: Contour plot of flux surfaces 
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In this cylindrical coordinate system the Lagrangian, Eq.(3.12), has the form 


ia “ (i? 26” 4 2) | grb Ao — ad(r, z, t). (3.155) 


Since @ is an ignorable coordinate, the canonical angular momentum is a constant of the 
motion, i.e. 


OL . 
Pj= an = mr? + qrAg = const. (3.156) 
or, in terms of flux, 
Py = mr?6 + yr, z, t) = const. (3.157) 
Thus, the Hamiltonian is 
H = z a 126” 4 A tolr, z,t) 
2 


= T (2 2?) x(r,z,t) 


where 


1 | Po —qu(r, z,t)/27 ? 
2m r 


x(r, z,t) = (3.159) 


is an effective potential. For purposes of plotting, the effective potential can be written in a 
dimensionless form as 


27 Po V(r, z,t) ; 


x(r, Z, t) qavo Wo 
(3.160) 
Xo r/L 


where L is some reference scale length, Yo is some arbitrary reference value for the flux, 
and Xo = qwo/8x?L?m. For simplicity we have set ¢(r, z,t) = 0, since this term gives 
the motion of a particle in a readily understood, two-dimensional electrostatic potential. 

Suppose that for times ¢ < t; the coil currents are constant in which case the associated 
magnetic field and flux are also constant. Since the Lagrangian does not explicitly depend 
on time, the energy H is a constant of the motion. Hence there are two constants of the 
motion, H and Ps. Consider now a particle located initially on the midplane z = O with 
r < Tmax- The particle motion depends on the sign of qù /Pp and so we consider each 
polarity separately. 
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Figure 3.14: Specific example (with z dependence suppressed) showing w and x rela- 
tionship: top is plot of function Y(r)/Yo = (r/L)*/(1 + (r/L)°), middle and bot- 
tom plots show corresponding normalized effective potential for 2r P/qYoọ = +0.2 and 
27 Pa/qWo = —0.2. Both middle and bottom plots have a minimum at r/L ~ 0.45; mid- 
dle plot also has a minimum at r/L ~ 1.4. The two minima in the middle plot occur when 
X(r)/Xo = 0 but the single minimum in the bottom plot occurs at a finite value of x(r)/Xo 
indicating that an axis-encircling particle must have finite energy. 


1. qw/Po is positive. If 27| P| < |Y max] there exists a location inside fmax where 
P) = Ly (3.161) 


and there exists a location outside fmax Where this equality holds as well. x vanishes 
at these two points which are also local minima of x because x is positive-definite. 
The top plot in Fig.3.14 shows a nominal W(r)/wWo flux profile and the middle plot 
shows the corresponding x(r)/Xo ; the z and t dependence are suppressed from the 
arguments for clarity. There exists a maximum of x between the two minima. We 
consider a particle initially located in one of the two minima of x. If H < Xmax the 
particle will be confined to an effective potential well centered about the flux surface 
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defined by Eq.(3.161). From Eq.(3.157) the angular velocity is 


jer (r = w) f (3.162) 


mr? 2T 


The sign of Å reverses periodically as the particle bounces back and forth in the x 
potential well. This corresponds to localized gyromotion as shown in Fig.3.15. 


non-axis-encircling axis-encircling 


OC 


Figure 3.15: Localized gyro motion associated with particle bouncing in effective potential 
well. 


2. qw/Po is negative. In this case x can never vanish, because Py — qw/2m never 
vanishes. Nevertheless, it is still possible for x to have a minimum and hence a 
potential well. This possibility can be seen by setting 0x/Or = O which occurs 


when aa ao 
(Po 1 V) ea LLL) (3.163) 
217" / Or r 
Equation (3.163) can be satisfied by having 
4; 
Sif Le oe 
— | —— | =0 (3.164) 
Or r 
which implies 
qrt O (fy 
-=—— |- l 
2T a(g J (ETES) 
2 


Recall that 7 had a maximum, that Y% ~ r^ near r = O, and also that ~ — O as 
r — oo. Thus w/r ~ r for small r and Y/r — 0 for r — oo so that ~/r also 
has a maximum; this maximum is located at an r somewhat inside of the maximum 
of y. Thus Eq.(3.165) can only be valid at points inside of this maximum; other- 
wise the assumption of opposite signs for Pg and 7 would be incorrect. Furthermore 


102 Chapter 3. Motion of a single plasma particle 


Eq.(3.165) can only be satisfied if | Pg | is not too large, because the right hand side of 
Eq.(3.165) has a maximum value. If all these conditions are satisfied, then x will have 
a non-zero minimum as shown in the bottom plot of Fig.3.14. 
A particularly simple example of this behavior occurs if Eq.(3.165) is satisfied near the 
r = Oaxis (i.e., where Y ~ r?) so that this equation becomes simply 
peasy (3.166) 
27 
which is just the opposite of Eq.(3.161). Substituting in Eq.(3.162) we see that 0 now 
never changes sign; i.e., the particle is axis-encircling. The Larmor radius of this axis- 
encircling particle is just the radius of the minimum of the potential well, the radius where 
Eq.(3.165) holds. The azimuthal kinetic energy of the particle corresponds to the height of 
the minimum of x in the bottom plot of Fig.3.14. 


3.7.1 Temporal Reversal of Magnetic Field - Energy Gain 


Armed with this information about axis-encircling and non-axis encircling particles, we 
now examine the strongly non-adiabatic situation where a coil current starts at J = Ip, 
is reduced to zero, and then becomes J = —Ip, so that all fields and fluxes reverse sign. 
The particle energy will not stay constant for this situation because the Lagrangian depends 
explicitly on time. However, since symmetry is maintained, Pg must remain constant. Thus, 
a non-axis encircling particle (with radial location determined by Eq. (3.161)) will change 
to an axis-encircling particle if a minimum exists for x when the sign of w is reversed. If 
such a minimum does exist and if the initial radius was near the axis where y ~ r°, then 
comparison of Eqs.(3.161) and (3.166) shows that the particle will have the same radius 
after the change of sign as before. The particle will gain energy during the field reversal by 
an amount corresponding to the finite value of the minimum of x for the axis-encircling 
case. 

This process can also be considered from the point of view of particle drifts: Initially, 
the non-axis-encircling particle is frozen to a constant 7 surface (flux surface). When the 
coil current starts to decrease, the maximum value of the flux correspondingly decreases. 
The constant ~ contours on the inside of Ymax move outwards towards the location of 
Wimax Where they are annihilated. Likewise, the contours outside of Ymax move inwards 
to Ymax Where they are also annihilated. 

To the extent that the E x B drift is a valid approximation, its effect is to keep the 
particle attached to a surface of constant flux. This can be seen by integrating Faraday’s 
law over the area of a circle of radius r to obtain f ds - V x E = — f ds - 0B/0t and then 
invoking Stoke’s theorem to give 

ow 


Hoar = — =. (3.167) 


The theta component of E+ v x B = Ois 

Eo +v,B, —v,B, =0 (3.168) 
and from (3.146), B, = — (2rr) t ôy/ðz and B, = — (2nr)~* OW/Or. Combination of 
Eqs.(3.167) and (3.168) thus gives 


o0. (3.169) 
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Because w(r(t), z(t), t) is the flux measured in the frame of a particle moving with trajec- 
tory r(t) and z(t), Eq.(3.169) shows that the E x B drift maintains the particle on a surface 
of constant %, i.e., the E x B drift is such as to maintain di/dt = O where d/dt means 
time derivative as measured in the particle frame. 

The implication of this attachment of the particle to a surface of constant ~ can be 
appreciated by making an analogy to the motion of people initially located on the beach 
of a volcanic island which is slowly sinking into the sea. In order to avoid being drowned 
as the island sinks, the people will move towards the mountain top to stay at a constant 
height above the sea. The location of Ymax here corresponds to the mountain top and the 
particles trying to stay on surfaces of constant ~ correspond to people trying to stay at 
constant altitude. A particle initially located at some location away from the “mountain 
top” Ymax Moves towards Ymax if the overall level of all the 7 surfaces is sinking. The 
reduction of ~ as measured at a fixed position will create the azimuthal electric field given 
by Eq.(3.167) and this electric field will, as shown by Eqs.(3.168) and (3.169), cause an 
E x B drift which convects each particle in just such a way as to stay on a constant % 
contour. 

The E x B drift approximation breaks down when B becomes zero, i.e., when Y% changes 
polarity. This breakdown corresponds to a breakdown of the adiabatic approximation. If 
w changes polarity before a particle reaches Ymax» the particle becomes axis-encircling. 
The extra energy associated with being axis-encircling is obtained when Y ~ 0 but 0¢/0t # 
O so that there is an electric field Eg, but no magnetic field. Finite Fg and no magnetic 
field results in a simple theta acceleration of the particle. Thus, when w reverses polarity the 
particle is accelerated azimuthally and develops finite kinetic energy. After 7) has changed 
polarity the magnitude of ~ increases and the adiabatic approximation again becomes valid. 
Because the polarity is reversed, increase of the magnitude of ~ is now analogous to creat- 
ing an ever deepening crater. Particles again try to stay on constant flux surfaces as dictated 
by Eq.(3.169) and as the crater deepens, the particles have to move away from Vmin to 
stay at the same altitude. When the reversed flux attains the same magnitude as the origi- 
nal flux, the flux surfaces have the same shape as before. However, the particles are now 
axis-encircling and have the extra kinetic energy obtained at field reversal. 


3.7.2 Spatial reversal of field - cusps 


Suppose two solenoids with constant currents are arranged coaxially with their magnetic 
fields opposing each other as shown in Fig.3.16(a). Since the solenoid currents are constant, 
the Lagrangian does not depend explicitly on time in which case energy is a constant of the 
motion. Because of the geometrical arrangement, the flux function is anti-symmetric in z 
where z = 0 defines the midplane between the two solenoids. 

Consider a particle injected with initial velocity v =v,92 at z = —L,r = a. Since 
this particle has no initial v; , it simply streams along a magnetic field line. However, 
when the particle approaches the cusp region, the magnetic field lines start to curve causing 
the particle to develop both curvature and grad B drifts perpendicular to the magnetic 
field. When the particle approaches the z = O plane, the drift approximation breaks down 
because B — 0 and so the particle’s motion becomes non-adiabatic [cf. Fig.3.16(a)]. 

Although the particle trajectory is very complex in the vicinity of the cusp, it is still 
possible to determine whether the particle will cross into the positive z half-plane, i.e., 
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cross the cusp. Such an analysis is possible because two constants of the motion exist, 
namely P) and H. The energy 


H=— 24 = t. 3.170 
2m 2m 2mr? ee ( ) 
can be evaluated using 
i q q 
Po = |mr?0+—y T 3.171 
9 dai 27 ]initial 20 = ( ) 
since initially 6 = 0. Here 
Wo = V(r =a,z=—-L) (3.172) 


is the flux at the particle’s initial position. Inserting initial values of all quantities in 
Eq.(3.170) gives 


H = a (3.173) 
2 
and so Eq.(3.170) becomes 
2 
2 2 (+) -ulr z) 
muzo _ mw MU On 0 ? 

2 2,7 2,7 2m? (3.174) 

o mw mv; mv; 

2 2 2 


The extent to which a particle penetrates the cusp can be easily determined if the particle 
starts close enough to r = 0 so that the flux may be approximated as 7 ~ r?. Specifically, 
the flux will be y = B orr? where B.o is the on-axis magnetic field in the z << 0 region. 
The canonical momentum is simply P) = qy)/27 = qBz0a?/2 since the particle started as 
non-axis encircling. 
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Figure 3.16: (a) Cusp field showing trajectory for particle with sufficient initial energy to 
penetrate the cusp; (b) two cusps used as magnetic trap to confine particles. 


Suppose the particle penetrates the cusp and arrives at some region where again Y ~ r?. 
Since the particle is now axis-encircling, the relation between canonical momentum and 
flux is Po = —qu/2n = —q(—B.otr?)/27 = qB-or? /2 from which it is concluded that 
r = a. Thus, if the particle is able to move across the cusp, it becomes an axis-encircling 
particle with the same radius r = a it originally had when it was non-axis-encircling. The 
minimum energy an axis-encircling particle can have is when it is purely axis encircling, 
i.e., has v, = 0 and v, = 0. Thus, for the particle to cross the cusp and reach a location 
where it becomes purely axis-encircling, the particle’s initial energy must satisfy 


mv — m(w,a)* 
Zo ma 2 


(3.175) 


or simply 
Uz0 > Wed. (3.176) 
If vzo is too small to satisfy this relation, the particle reflects from the cusp and returns 
back to the negative z half-plane. Plasma confinement schemes have been designed based 
on particles reflecting from cusps as shown in Fig.3.16(b). Here a particle is trapped be- 
tween two cusps and so long as its parallel energy is insufficient to violate Eq.(3.176), the 
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particle is confined between the two cusps. 

Cusps have also been used to trap relativistic electron beams in mirror fields (Hudgings, 
Meger, Striffler, Destler, Kim, Reiser and Rhee 1978, Kribel, Shinksky, Phelps and Fleischmann 
1974). In this scheme an additional opposing solenoid is added to one end of a magnetic 
mirror so as to form a cusp outside the mirror region. A relativistic electron beam is in- 
jected through the cusp into the mirror. The beam changes from non-axis-encircling into 
axis-encircling on passing through the cusp as in Fig.3.16(a). If energy is conserved, the 
beam is not trapped because the beam will reverse its trajectory and bounce back out of the 
mirror. However, if axial energy is removed from the beam once it is in the mirror, then the 
motion will not be reversible and the beam will be trapped. Removal of beam axial energy 
has been achieved by having the beam collide with neutral particles or by having the beam 
induce currents in a resistive wall. 


3.7.3 Stochastic motion in large amplitude, low frequency waves 


The particle drifts (E x B, polarization, etc.) were derived using an iteration scheme which 
was based on the assumption that spatial changes in the electric and magnetic fields are suf- 
ficiently gradual to allow Taylor expansions of the fields about their values at the gyrocen- 
ter. 

We now examine a situation where the fields change gradually in space relative to the 
initial gyro-orbit dimensions, but the fields also pump energy into the particle motion so 
that eventually the size of the gyro-orbit increases to the point that the smallness assumption 
fails. To see how this might occur consider motion of a particle in an electrostatic wave 


E = ykosin(ky — wt) (3.177) 


which propagates in a plasma immersed in a uniform magnetic field B =Bz. The wave 
frequency is much lower than the cyclotron frequency of the particle in question. This 
w << We condition indicates that the drift equations in principle can be used and so ac- 
cording to these equations, the charged particle will have both an E x B drift 


ExB kod, 
Vg = Re =g sin(ky — wt). (3.178) 
and a polarization drift 
m dE, Ț7 mkọd . 
Vp = TB? dt =y 7B? di sin(ky — wt). (3.179) 


If the wave amplitude is infinitesimal, the spatial displacements associated with v p and 
vp are negligible and so the guiding center value of y may be used in the right hand side of 
Eq.(3.179) to obtain 

wmk 


Vp = -¥ ee cos(ky — wt). (3.180) 


Equations (3.178) and (3.180) show that the combined vg and v, particle drift motion 
results in an elliptical trajectory. 

Now suppose that the wave amplitude becomes so large that the particle is displaced 
significantly from its initial position. Since the polarization drift is in the y direction, there 
will be a substantial displacement in the y direction. Thus, the right side of Eq.(3.179) 
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should be construed as sin [ky(t) — wt] so that, taking into account the time dependence 
of y on the right hand sided, Eq.(3.179) becomes 


an ed egg ss S58 coed 
Vp Yop at thy wt)= ¥ 


mko d mko G = w) cos(ky — wt). (3.181) 


However, dy/dt = vp since v, is the motion in the y direction. Equation (3.181) becomes 
an implicit equation for v, and may be solved to give 


.wmkd cos(ky — wt) 


= 182 
Sp qB? [1— acos(ky — wt)] (3180) 
where ; 
mk op 
a= JB? (3.183) 


is a non-dimensional measure of the wave amplitude (McChesney, Stern and Bellan 1987, 
White, Chen and Lin 2002). 

If œa > 1, the denominator in Eq.(3.182) vanishes when ky — wt = cos™ ta~! and 
this vanishing denominator would result in an infinite polarization drift. However, the 
derivation of the polarization drift was based on the assumption that the time derivative 
of the polarization drift was negligible compared to the time derivative of vg, i.e., it was 
explicitly assumed dv/dt << dvp /dt. Clearly, this assumption fails when v, becomes 
infinite and so the iteration scheme used to derive the particle drifts fails. What is happening 
is that when a ~ 1, the particle displacement due to polarization drift becomes ~ k™t. 
Thus the displacement of the particle from its gyrocenter is of the order of a wavelength. In 
such a situation it is incorrect to represent the its actual location by its gyrocenter because 
the particle experiences the wave field at the particle’s actual location, not at its gyrocenter. 
Because the wave field is significantly different at two locations separated by ~ k~!, it is 
essential to evaluate the wave field evaluated at the actual particle location rather than at 
the gyrocenter. 

Direct numerical integration of the Lorentz equation in this large-amplitude limit shows 
that when a exceeds unity, particle motion becomes chaotic and cannot be described by 
analytic formulae. Onset of chaotic motion resembles heating of the particles since chaos 
and heating both broaden the velocity distribution function. However, chaotic heating is 
not a true heating because entropy is not increased — the motion is deterministic and not 
random. Nevertheless, this chaotic (or stochastic) heating is indistinguishable for practical 
purposes from ordinary collisional thermalization of directed motion. 

An alternate way of looking at this issue is to consider the Lorentz equations for two 
initially adjacent particles, denoted by subscripts 1 and 2 which are in a wave electric field 
and a uniform, steady-state magnetic field (Stasiewicz, Lundin and Marklund 2000). The 
respective Lorentz equations of the two particles are 


dv; 

T = [E(x,,t)+v1 xB] 

dv2 q 

pr = m [E(x,,t)+v1 xB] 5 (3.184) 


Subtracting these two equations gives an equation for the difference between the velocities 
of the two particles, ôv = vı — V2 in terms of the difference 6x = x, —xXg in their positions, 
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i.e., 
dô 
SN 1 [5x- VE + ôv x B] . (3.185) 


The difference velocity is T, to the difference in positions by déx/dt=dv. Let y be 
the direction in which the electric field is non-uniform, i.e., with this choice of coordinate 
system E depends only on the y direction. To simplify the algebra, define Es = qE,/m 
and €, = qE,/m so the components of Eq.(3.185) transverse to the magnetic field are 


OE x . 
64 = dy—-+w.dy 
Oy 
OE 
oy = by—*-w. de. (3.186) 
oy 
Now take the time derivative of the lower equation to obtain 
m OE ð (dé 
0Y = 6y—* +ô u zó .187 
y E v5, (S) Wd (3.187) 
and then substitute for 6% giving 
iia OE ð (dé OE 
Oy = by—* +6 2 We | 6 =+w dy } . 3.188 
ae wy (at) (8-55 =) a 
This can be re-arranged as 
5s 1 0€ oE O (dé 
oY +w2(1 2) dy = w.dy— — ô uy. 1 
í a ( saa e By ila) ee. 


Consider the right hand side of the equation as being a forcing term for the left hand side. 
If wz tE, /Əðy < 1, then the left hand side is a simple harmonic oscillator equation in 
the variable 6y. However, if w-'0E,/Oy exceeds unity, then the left hand side becomes 
an equation with solutions that grow exponentially in time. If two particles are initially 
separated by the infinitesimal distance dy and if w,'0E,/Oy < 1 the separation distance 
between the two particles will undergo harmonic oscillations, but if wz 13E, /ðy > 1 the 
separation distance will exponentially diverge with time. It is seen that œ corresponds to 
wz ‘OE, /dy for a sinusoidal wave. Exponential growth of the separation distance between 
two particles that are initially arbitrarily close together is called stochastic behavior. 


3.8 Motion in small-amplitude oscillatory fields 


Suppose a small-amplitude electromagnetic field exists in a plasma which in addition has a 
large uniform, steady-state magnetic field and no steady-state electric field. The fields can 
thus be written as 


E = E, (x, t) 

B = Bo+Bi(x,t) (3.190) 
where the subscript | denotes the small amplitude oscillatory quantities and the subscript 0 
denotes large, uniform equilibrium quantities. A typical particle in this plasma will develop 


an oscillatory motion 
x(t) = (x(t)) +6x(t) (3.191) 
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where (x(t)) is the particle’s time-averaged position and 6x(t) is the instantaneous devia- 
tion from this average position. If the amplitudes of E; (x, t) and B;(x, ¢) are sufficiently 
small, then the fields at the particle position can be approximated as 


E((x(¢)) +ôx(t) t) = Ex((x()),) 
B((x(t)) +6x(t), t) Bo + By ((x(t)) ,t) (3.192) 


This is the opposite limit from what was considered in Section 3.7.3. The Lorentz equation 
reduces in this small-amplitude limit to 
mS = q[Ex((x) 4) +vx (Bo + B1 (x0) 0). (3.193) 

Since the oscillatory fields are small, the resulting particle velocity will also be small 
(unless there is a resonant response as would happen at the cyclotron frequency). If the 
particle velocity is small, then the term v x Bı (x,t) is of second order smallness, whereas 
E; and v x Bo are of first-order smallness. The v x B,(x,¢) is thus insignificant com- 
pared to the other two terms on the right hand side and therefore can be discarded so that 
the Lorentz equation reduces to 

dv 
mgl [E ((x) ,t) + v x Bol, (3.194) 
a linear differential equation for v. Since ôx is assumed to be so small that it can be 
ignored, the average brackets will be omitted from now on and the first order electric field 
will simply be written as E; (x,t) where x can be interpreted as being either the actual or 
the average position of the particle. 

The oscillatory electric field can be decomposed into Fourier modes, each having time 
dependence ~ exp(—iwt) and since Eq.(3.194) is linear, the particle response to a field 
E; (x,t) is just the linear superposition of its response to each Fourier mode. Thus it is 
appropriate to consider motion in a single Fourier mode of the electric field, say 


I? 


E, (x,t) = E(x,w) exp(—iwt). (3.195) 
If initial conditions are ignored for now, the particle motion can be found by simply assum- 
ing that the particle velocity also has the time dependence exp(—iwt) in which case the 
Lorentz equation becomes 


—iwmv = q |E(x) +v x Bo (3.196) 


where a factor exp(—iwt) is implicitly assumed for all terms and also an w argument is 
implicitly assumed for E. Equation (3.196) is a vector equation of the form 


v+vxA=C (3.197) 
where F 
A= 
1W 
B 
gee (3.198) 
m 
C=- LĒ) 
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and the z axis has been chosen to be in the direction of By. Equation (3.197) can be solved 
for v by first dotting with A to obtain 


A-v=C-A (3.199) 
and then crossing with A to obtain 


vxA+AA-v—vA?7=CxA. (3.200) 
Substituting for A - v using Eq.(3.199) and for v x A using Eq.(3.197) gives 


C+AA-C-CxrA Cs Cı AxC 

14 A? IT IFAÆ TAR 

where C has been split into parallel and perpendicular parts relative to Bo and AA - C 
=A’C| z has been used. On substituting for A and C this becomes 


(3.201) 


iq | ~ E, (x) iwe 2 x E(x) 


=— |E z 4 
i wm a 1 — w2/w? w 1— w/w? 


ae (3.202) 


The third term on the right hand side is a generalization of the E x B drift, since for 
w << we this term reduces to the E x B drift. Similarly, the middle term on the right hand 
side is a generalization of the polarization drift, since for w << we this term reduces to the 
polarization drift. The first term on the right hand side, the parallel quiver velocity, does 
not involve the magnetic field Bo. This non-dependence on magnetic field is to be expected 
because no magnetic force results from motion parallel to a magnetic field. In fact, if the 
magnetic field were zero, then the second term would add to the first and the third term 
would vanish, giving a three dimensional unmagnetized quiver velocity v = iqE(x) /wm. 
If the electric field is in addition decomposed into spatial Fourier modes with depen- 
dence ~ exp(ik - x), then the velocity for a typical mode will be 
i aes E, iwe 2xE ik-x—iw 
v(x) =- erage ee gaa (3.203) 


The convention of a negative coefficient for w and a positive coefficient for k has been 
adopted to give waves propagating in the positive x direction. Equation (3.203) will later 
be used as the starting point for calculating wave-generated plasma currents. 


3.9 Wave-particle energy transfer 


3.9.1 ‘Average velocity’ 


Anyone who has experienced delay in a traffic jam knows that it is usually impossible to 
make up for the delay by going faster after escaping from the traffic jam. To see why, 
define a as the fraction of the total trip length in the traffic jam, vs as the slow (traffic jam) 
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speed, and v; as the fast speed (out of traffic jam). It is tempting, but wrong, to say that the 
average velocity is (1 — a)vy + av, because 


total distance 


average velocity of a trip = (3.204) 


total time 
Since the fast-portion duration is ty = (1 — a)L/vy while the the slow-portion duration is 
ts = aL /vs, the average velocity of the complete trip is 
L 1 
Vig = = : 
 (1—a)L/vs+aL/vs (1—a)/vs +a/vs 

Thus, if vs << vy then Vavg = Vs /œ which (i) is not the weighted average of the fast and 
slow velocities and (ii) is almost entirely determined by the slow velocity. 


(3.205) 


3.9.2 Motion of particles in a sawtooth potential 


The exact motion of a particle in a sinusoidal potential can be solved using elliptic integrals, 
but the obtained solution is implicit, i.e., the solution is expressed in the form of time 
as a function of position. While exact, the implicit nature of this solution obscures the 
essential physics. In order to shed some light on the underlying physics, we will first 
consider particle motion in the contrived, but analytically tractable, situation of the periodic 
sawtooth-shaped potential shown in Fig.3.17 and then later will consider particle motion in 
a more natural, but harder to analyze, sinusoidal potential. 

When in the downward-sloping portion of the sawtooth potential, a particle experiences 
a constant acceleration +a and when in the upward portion it experiences a constant accel- 
eration —a. Our goal is to determine the average velocity of a group of particles injected 
with an initial velocity vo into the system. Care is required when using the word ‘average’ 
because this word has two meanings depending on whether one is referring to the average 
velocity of a single particle or the average velocity of a group of particles. The average ve- 
locity of a single particle is defined by Eq.(3.204) whereas the average velocity of a group 
of particles is defined as the sum of the velocities of all the particles in the group divided 
by the number of particles in the group. 

The average velocity of any given individual particle depends on where the particle 
was injected. Consider the four particles denoted as A, B, C, and D in Fig.3.17 as rep- 
resentatives of the various possibilities for injection location. Particle A is injected at a 
potential maximum, particle C’ at a potential minimum, particle B is injected half way on 
the downslope, and particle D is injected half way on the upslope. 


Figure 3.17: Initial positions of particles A,B,C’, and D. All are injected with same initial 
velocity vp, moving to the right. 
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The average velocity for each of these four representative particles will now be evalu- 
ated: 

Particle A — Let the distance between maximum and minimum potential be d. Let x = 0 
be the location of the minimum so the injection point is at x = —d. Thus the trajectory on 
the downslope is 

x(t) = —d + vot + at? /2 (3.206) 
and the time for particle A to go from its injection point to the potential minimum is found 
by setting z(t) = 0 giving 


‘a= m (-1 +vVI+ 25 ) (3.207) 


where 6 = ad /vē is the normalized acceleration. When particle A reaches the next poten- 
tial peak, it again has velocity vo and if the time and space origins are re-set to be at the 
new peak, the trajectory will be 


a(t) = vot — at? /2. (3.208) 


The negative time when the particle is at the preceding potential minimum is found from 


—d = vot — at? /2. (3.209) 


Solving for this negative time and then calculating the time increment to go from the mini- 
mum to the maximum shows that this time is the same as going from the maximum to the 
minimum, i.e., 44. = te Thus the average velocity for particle A is 


A w 
9 14+ VIF 26— 
The average velocity of particle A is thus always faster than its injection velocity. 


Particle C — Now let x = 0 be the location of maximum potential and x = —d be the 
point of injection so the particle trajectory is 


v (3.210) 


a(t) = —d + vot — at? /2 (3.211) 


and the time to get to x = Ois 


= (1 = v1—28) (3.212) 


From symmetry it is seen that the time to go from the maximum to the minimum will be 
the same so the average velocity will be 


Co ad/vo 
wg 1- V1 = 26 - 
Because particle B is always on a potential hill relative to its injection position, its average 
velocity is always slower than its injection velocity. 

Particles B and D- Particle B can be considered as first traveling in a potential well and 
then in a potential hill, while the reverse is the case for particle D. For the potential well 


v (3.213) 
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portion, the forces are the same, but the distances are half as much, so the time to traverse 
the potential well portion is 


2 
AE = (-1 cart sie 5) i (3.214) 


Similarly, the time required to traverse the potential hill portion will be 


2 
Dans a (1 EN 5 ) (3.215) 


so the average velocity for particles B and D will be 


cae ad/vo 
Ue = : 3.216 
ee ee A oe 
These particles move slower than the injection velocity, but the effect is second order in 6. 
The average velocity of the four particles will be 


Cage = 5 va UE tuS HoR) (3.217) 


ad 1 1 2 
Avo Er VI+28 1-v1-28 vVI+ô-— exit 


If ô is small this expression can be approximated as 


ad 1 1 2 
w E ' ' 3.218 
“ang 4500 Ferrer 14+6/2+ 67/2 Fal AaS 
_ ad Hie e i 
~ 28v [14+367/4 © 1467/8 


2 


~ vo [1—367/16] 


so that the average velocity of the four representative particles is smaller than the injection 
velocity. This effect is second order in 6 and shows that a group of particles injected 
at random locations with identical velocities into a sawtooth periodic potential will, on 
average, be slowed down. 


3.9.3 Slowing down, energy conservation, and average velocity 


The sawtooth potential analysis above shows that is necessary to be very careful about what 
is meant by energy and average velocity. Each particle individually conserves energy and 
regains its injection velocity when it returns to the phase at which it was injected. However, 
the average velocities of the particles are not the same as the injection velocities. Particle A 
has an average velocity higher than its injection velocity whereas particles B, C and D have 
average velocities smaller than their respective injection velocities. The average velocity of 
all the particles is less than the injection velocity so that the average kinetic energy of the 
particles is reduced relative to the injection kinetic energy. Thus the average velocity of a 
group of particles slows down in a periodic potential, yet paradoxically individual particles 
do not lose energy. The energy that appears to be missing is contained in the instantaneous 
potential energy of the individual particles. 
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3.9.4 Wave-particle energy transfer in a sinusoidal wave 


The calculation will now redone for the physically more relevant situation where a group 
of particles interact with a sinusoidal wave. As a prerequisite for doing this calculation it 
must first be recognized that two distinct classes of particles exist, namely those which are 
trapped in the wave and those which are not. The trajectories of trapped particles differs 
in a substantive way from untrapped particles, but for low amplitude waves the number of 
trapped particles is so small as to be of no consequence. It therefore will be assumed that 
the wave amplitude is sufficiently small that the trapped particles can be ignored. 

Particle energy is conserved in the wave frame but not in the lab frame because the 
particle Hamiltonian is time-independent in the wave frame but not in the lab frame. Since 
each additional conserved quantity reduces the number of equations to be solved, it is 
advantageous to calculate the particle dynamics in the wave frame, and then transform 
back to the lab frame. 

The analysis in Sec.3.9.2 of particle motion in a sawtooth potential showed that ran- 
domly phased groups of particles have their average velocity slow down, i.e., the average 
velocity of the group tends towards zero as observed in the frame of the sawtooth potential. 
If the sawtooth potential were moving with respect to the lab frame, the sawtooth poten- 
tial would appear as a propagating wave in the lab frame. A lab-frame observer would see 
the particle velocities tending to come to rest in the sawtooth frame, i.e., the lab-frame av- 
erage of the particle velocities would tend to converge towards the velocity with which the 
sawtooth frame moves in the lab frame. 

The quantitative motion of a particle in a one-dimensional wave potential d(z,t) = 
po cos(ka — wt) will now be analyzed in some detail. This situation corresponds to a 
particle being acted on by a wave traveling in the positive x direction with phase velocity 
w/k. It is assumed that there is no magnetic field so the equation of motion is simply 


dv = qkoo 
dt m 

At t = 0 the particle’s position is x = xo and its velocity is v = vo. The wave phase at the 
particle location is defined to be Yy = kx — wt. This is a more convenient variable than x 
and so the differential equations for x will be transformed into a corresponding differential 
equation for 7. Using 4 as the dependent variable corresponds to transforming to the wave 
frame, i.e., the frame moving with the phase velocity w/k, and makes it possible to take 
advantage of the wave-frame energy being a constant of the motion. The equations are less 
cluttered with minus signs if a slightly modified phase variable 0 = ka — wt — 7 is used. 

The first and second derivatives of 0 are 


sin(ka — wt). (3.219) 


dé 
an ku — w (3.220) 
and 
d?6 du 
-z = Rae (3.221) 
Substitution of Eq.(3.221) into Eq.(3.219) gives 
d0 k? 
EE E (3.222) 


dt? 
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By defining the bounce frequency 


k2 
ua i, (3.223) 
m 


and the dimensionless bounce-normalized time 
T = wot, (3.224) 
Eq. (3.222) reduces to the pendulum-like equation 


d?6 
T + sin = 0. (3.225) 


Upon multiplying by the integrating factor 2d@/dr, Eq.(3.225) becomes 


a | (ao\* 
T (2) — 2 cos j =0. (3.226) 
This integrates to give 
d9 \? 
2cos 6 = à = const. (3.227) 
dr 


which indicates the expected energy conservation in the wave frame. The value of A is 
determined by two initial conditions, namely the wave-frame injection velocity 


dé 1 (dé kug — Ww (3.228) 
ar) o 2s db der w 5 i 


and the wave-frame injection phase 


0-=0 = kto — T = bo. (3.229) 
Inserting these initial values in the left hand side of Eq. (3.227) gives 
à = a? — 2 cos bo. (3.230) 
Except for a constant factor, 
e is the total wave-frame energy 
e (dé/dr)? is the wave-frame kinetic energy 


e  —2cos 0 is the wave-frame potential energy. 


If —2 < A < 2, then the particle is trapped in a specific wave trough and oscillates 
back and forth in this trough. However, if A > 2, the particle is untrapped and travels 
continuously in the same direction, speeding up when traversing a potential valley and 
slowing down when traversing a potential hill. 

Attention will now be restricted to untrapped particles with kinetic energy greatly ex- 
ceeding potential energy. For these particles 


a S32 (3.231) 
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which corresponds to considering small amplitude waves since a ~ wr! and wy, ~ Vto ; 

We wish to determine how these untrapped particles exchange energy with the wave. 
To accomplish this the lab-frame kinetic energy must be expressed in terms of wave-frame 
quantities. From Eqs.(3.220) and (3.224) the lab-frame velocity is 


1 dé wy [ w dé 
= = 3.232 
=k (e T) k (= T) eana) 
so that the lab-frame kinetic energy can be expressed as 
1 > mw? eye w dé do\? 
W=- = H2 } ; 2 
27” = “one (e) wedr ` \dr oa?) 
Substituting for (d@/d7)? using Eq.(3.227) gives 
mw? eye w dé 
w=— + 2 + \+2cos 6] . 3.234 
2k? (=) wy dr om ( ) 


Since wave-particle energy transfer is of interest, attention is now focused on the changes 
in the lab-frame particle kinetic energy and so we consider 


dw mw? w d0 dé. 9 
eke, Be sin 
dt k2 |uw,dr?2 dr 
w? dé 
_ ~~ sino |= id (3.235) 


where Eq.(3.225) has been used. To proceed further, it is necessary to obtain the time 
dependence of both sin 0 and d@/dt. 


Solving Eq.(3.227) for d0 /dr and assuming a >> 1 (corresponding to untrapped par- 
ticles) gives 
dé 


I = +VA+2cos0 
= 


= +,/a? + 2(cos 6 — cos 40) 
(3.236) 


5 (1 i a 


az 


cos 0 — cos ĝo 
SS: Os 
This expression is valid for both positive and negative a, i.e. for particles going in either 
direction in the wave frame. The first term in the last line of Eq. (3.236) gives the velocity 
the particle would have if there were no wave (unperturbed orbit) while the second term 
gives the perturbation due to the small amplitude wave. The particle orbit 6(7) is now 
solved for iteratively. To lowest order (i.e., dropping terms of order œ~?) the particle 
velocity is 
do 


ane (3.237) 
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and so the rate at which energy is transferred from the wave to the particles is 


dw mw? no w 5 
a T Te rillaaess 
F 
= me sin 0 (3.238) 
Integration of Eq.(3.237) gives the unperturbed orbit solution 
OIT) = 00 + aT. (3.239) 


This first approximation is then substituted back into Eq.(3.236) to get the corrected form 


dé =e cos(09 + aT) — cos ĝo (3.240) 
dr a 


which may be integrated to give the corrected phase 


Cate a a7) AT (3.241) 
a a 
From Eq.(3.241) we may write 
sin = sin[(09 + aT) + A(7)] (3.242) 
where Nar tj 
A(T) = Mot o] TELA Eog bo (3.243) 


a 
is the ‘perturbed-orbit’ correction to the phase. If consideration is restricted to times where 
T << |al, the phase correction A(T) will be small. This restriction corresponds to 


(wet)? << |kvo — wt (3.244) 
which means that the number of wave peaks the particle passes greatly exceeds the number 
of bounce times. Since bounce frequency is proportional to wave amplitude, this condition 


will be satisfied for all finite times for an infinitesimal amplitude wave. Because A is 
assumed to be small, Eq.(3.242) may be expanded as 


sin 0 = sin(@)+aT) cos A+sin A cos(09+aT) ~ sin(99 +aT)+A cos(fo+arT) (3.245) 

so that Eq. (3.238) becomes 

dw B mw2vo 

d k 
The wave-to-particle energy transfer rate depends on the particle initial position. This 
is analogous to the earlier sawtooth potential analysis where it was shown that whether 
particles gain or lose average velocity depends on their injection phase. It is now assumed 
that there exist many particles with evenly spaced initial positions and then an averaging 
will be performed over all these particles which corresponds to averaging over all initial 
injection phases. Denoting such averaging by () gives 


dw B mw2vo 
(Z) = TE (A cos(ĝo + aT)) 


[sin(@9 + aT) + A cos(fo + a7)]. (3.246) 


ay? i — si 
meso ( jane + ar) —sin bo x cos 6 cos(89 + aT) 


(3.247) 


a2 


118 Chapter 3. Motion of a single plasma particle 


Using the identities 
(sin(09 + aT) cos(99 + aT)) = 0 
(sin 09 cos(9) + aT)) = —$ sin ar (3.248) 
(cos 09 cos(09 + aT)) = £ cosat 


2 
the wave-to-particle energy transfer rate becomes 


dw 2 mw?vo sinaT T ern hes mw?vo d /sinat i (3.249) 
dt 2k ae a 2k da a 
At this point it is recalled that one representation for a delta function is 
in( N 
Cys te (3.250) 
N- co TZ 
so that for |ar| >> 1 Eq.(3.249) becomes 
dw TMwW?V9 d 
= óla). 3.251 
( dt ) oe dae mae 


Since 6(z) has an infinite positive slope just to the left of z = 0 and an infinite negative 
slope just to the right of z = 0, the derivative of the delta function consists of a positive 
spike just to the left of z = O and a negative spike just to the right of z = 0. Furthermore 
a = (kvo — w)/wy is slightly positive for particles moving a little faster than the wave 
phase velocity and slightly negative for particles moving a little slower. Thus (dW/dt) 
is large and positive for particles moving slightly slower than the wave, while it is large 
and negative for particles moving slightly faster. If the number of particles moving slightly 
slower than the wave equals the number moving slightly faster, the energy gained by the 
slightly slower particles is equal and opposite to that gained by the slightly faster particles. 

However, if the number of slightly slower particles differs from the number of slightly 
faster particles, there will be a net transfer of energy from wave to particles or vice versa. 
Specifically, if there are more slow particles than fast particles, there will be a transfer of 
energy to the particles. This energy must come from the wave and a more complete analysis 
(cf. Chapter 5) will show that the wave damps. The direction of energy transfer depends 
critically on the slope of the distribution function in the vicinity of v = w/k, since this 
slope determines the ratio of slightly faster to slightly slower particles. 

We now consider a large number of particles with an initial one-dimensional distribution 
function f(vo) and calculate the net wave-to-particle energy transfer rate averaged over 
all particles. Since f(vo)dvo is the probability that a particle had its initial velocity between 
vo and vo + dvo, the energy transfer rate averaged over all particles is 


AWirotal nmw?vo d 
—— = d ——_°— — ó 
( EP ) J vof (vo) I A (a) 


3 
TMW, d kvo — w 
2k2 J vo f (vo)vo doo ( os ) 


4 ( 
= Mh [ tofo (vo — ~) 


2k3 dvo k 
4 
nmw; |d 
= — : 3.252 
2k3 E (F(en)e) aui ( ) 
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If the distribution function has the Maxwellian form f ~ exp(—vj/2v2.) where vr is the 
thermal velocity, and if w/k >> ur then 


d d 
ioo] = [ogi rood) + F000) 
‘Vo vo=w/k Vo vo=w/k 
võ 
= |-2—>-f(vo) + f (vo) (3.253) 
VT: vo=w/k 
showing that the derivative of f is the dominant term. Hence, Eq.(3.252) becomes 
AWotal nmwtw d 
= .254 
( dt ) 2k4 dvo (f(vo)) MEN (3.254) 


Substituting for the bounce frequency using Eq.(3.223) this becomes 


Co _ amy (a) | d (se) l (3.255) 


dt 2k? m dvo UIR 


Thus particles gain kinetic energy at the expense of the wave if the distribution function 
has negative slope in the range v ~ w/k. This process is called Landau damping and will 
be examined in the Chapter 5 from the wave viewpoint. 


3.10 Assignments 


1. A charged particle starts from rest in combined static fields E = Ey and B =B 2 where 
E/B << cand c is the speed of light. Calculate and plot its exact trajectory (do this 
both analytically and numerically). 


2. Calculate (qualitatively and numerically) the trajectory of a particle starting from rest 
at x = 0, y = 5a in combined F and B fields where E =Fo# and B =ZBoy/a. What 
happens to u conservation on the line y = 0? Sketch the motion showing both the 
Larmor motion and the guiding center motion. 


3. Calculate the motion of a particle in the steady state electric field produced by a line 
charge À along the z axis and a steady state magnetic field B =BọŻ. Obtain an approx- 
imate solution using drift theory and also obtain a solution using Hamilton-Lagrange 
theory. Hint -for the drift theory show that the electric field has the form E =fFA/2rr. 
Assume that \ is small for approximate solutions. 


4. Consider the magnetic field produced by a toroidal coil system; this coil consists of 
a single wire threading the hole of a torus (donut) N times with the NV turns evenly 
arranged around the circumference of the torus. Use Ampere’s law to show that the 
magnetic field is in the toroidal direction and has the form B = uN I /2rr where N 
is the total number of turns in the coil and J is the current through the turn. What are 
the drifts for a particle having finite initial velocities both parallel and perpendicular 
to this toroidal field. 
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Show that of all the standard drifts (E x B, VB, curvature, polarization) only the 
polarization drift causes a change in the particle energy. Hint: consider what happens 
when the following equation is dotted with v: 


ie ee 


. Use the numerical Lorentz solver to calculate the motion of a charged particle in a 


uniform magnetic field B = B2 and an electric field given by Eq.(3.177). Compare 
the motion to the predictions of drift theory (E x B, polarization). Describe the motion 
for cases where a << 1, œ œ 1, and œ >> 1 where a = mk?¢/qB?. Describe what 
happens when a becomes of order unity. 


. A “magnetic mirror” field in cylindrical coordinates r, 0, z can be expressed as B = 


(27) 1Vw x VO where y = Bowr?(1 + (z/L)*) where L is a characteristic length. 
Sketch by hand the field line pattern in the r, z plane and write out the components of 
B. What are appropriate characteristic lengths, times, and velocities for an electron 
in this configuration? Use r = (a? + y?)1/? and numerically integrate the orbit of an 
electron starting at x = 0, y = L, z = 0 with initial velocity v = 0 and initial v4, vz 
of the order of the characteristic velocity (try different values). Simultaneously plot 
the motion in the z, y plane and in the x,y plane. What interesting phenomena can be 
observed (e.g., reflection)? Does the electron stay on a constant % contour? 


. Consider the motion of a charged particle in the magnetic field 


1 
B = —Vv(r,z,t) x VO 


2T 
where 
2 GF 
p(r,z, t) = Buamtr* |1 + 2A— 
V( ) | æ rf -| 
and z 
C= Li) 


Show by explicit evaluation of the flux derivatives and also by plotting contours of 
constant flux that this is an example of a magnetic mirror field with minimum axial 
field Bmin when z = O and maximum axial field ABmin at z = L(t). By making 
L(t) a slowly decreasing function of time show that the magnetic mirrors slowly move 
together. Using numerical techniques to integrate the equation of motion, demonstrate 
Fermi acceleration of a particle when the mirrors move slowly together. Do not forget 
the electric field associated with the time-changing magnetic field (this electric field 
is closely related to the time derivative of ~(r,z,t); use Faraday’s law). Plot the 
velocity space angle at z = O for each bounce between mirrors and show that the 
particle becomes detrapped when this angle decreases below Oirap = sin ~'(A~ 1), 


. Consider a point particle bouncing with nominal velocity v between a stationary wall 


and a second wall which is approaching the first wall with speed u.Calculate the 
change in speed of the particle after it bounces from the moving wall (hint: do this first 
in the frame of the moving wall, and then translate back to the lab frame). Calculate Ta 
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the time for the particle to make one complete bounce between the walls if the nomi- 
nal distance between walls is L. Calculate AL, the change in L during one complete 
bounce and show that if u << v, then Lv is a conserved quantity. By considering 
collisionless particles bouncing in a cube which is slowly shrinking self-similarly in 
three dimensions show that PV°/? is constant where P = nT’, n is the density of 
the particles and T is the average kinetic energy of the particles. What happens if the 
shrinking is not self-similar (hint: consider the effect of collisions and see discussion 
in Bellan (2004a)). 


10. Using numerical techniques to integrate the equation of motion illustrate how a charged 
particle changes from being non-axis-encircling to axis-encircling when a magnetic 
field B =(27)~!V u(r, z,t) x VO reverses polarity at t = 0. For simplicity use Y = 
B(t)rr?, i.e., a uniform magnetic field. To make the solution as general as possi- 
ble, normalize time to the cyclotron frequency by defining T = wet, and set B(T) = 
tanhT to represent a polarity reversing field. Normalize lengths to some reference 
length L and normalize velocities to wL. Show that the canonical angular momen- 
tum is conserved. Hint - do not forget about the inductive electric field associated with 
a time-dependent magnetic field. 


1 


. Consider a cusp magnetic field given by B =(27)~! V(r, z) x V8 where the flux 
function 
2 z 
y(r, z) = Bar*———.. 

1+ 22/a? 

is antisymmetric in z. Plot the surfaces of constant flux. Using numerical techniques to 
integrate the equation of motion demonstrate that a particle incident at z <<< —a and 
r = ro with incident velocity v =v,92 will reflect from the cusp if vzo < rowe where 
we = qB/m. 


12. Consider the motion of a charged particle starting from rest in a simple one dimen- 
sional electrostatic wave field: 


d?a 
m —qV (x,t) 


where (x,t) = ġcos(kxz — wt). How large does have to be to give trapping of 
particles that start from rest. Demonstrate this trapping threshold numerically. 


13. Prove Equation (3.218). 
14. Prove that 


is a valid representation for the delta function. 


1 


Nn 


. As sketched in Fig.3.18, a current loop (radius r, current J) is located in the x — y 
plane; the loop’s axis defines the z axis of the coordinate system, so that the center 
of the loop is at the origin. The loop is immersed in a non-uniform magnetic field 
B produced by external coils and oriented so that the magnetic field lines converge 
symmetrically about the z-axis. The current J is small and does not significantly 
modify B. Consider the following three circles: the current loop, a circle of radius 
b coaxial with the loop but with center at z = —L/2 and a circle of radius a with 
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center at z = +L/2. The radii a and b are chosen so as to intercept the field lines that 
intercept the current loop (see figure). Assume the figure is somewhat exaggerated so 
that B, is approximately uniform over each of the three circular surfaces and so one 
may ignore the radial dependence of B, and therefore express B, = B,(z). 

(a) Note that r = (a + b)/2. What is the force (magnitude and direction) on the current 
loop expressed in terms of J, B,(0),a, b and L only? [Hint- use the field line slope to give 
a relationship between B, and B, at the loop radius.] 

(b) For each of the circles and the current loop, express the magnetic flux enclosed in 
terms of B, at the respective entity and the radius of the entity. What is the relationship 
between the B,’s at these three entities? 

(c) By combining the results of parts (a) and (b) above and taking the limit L — 0, 
show that the force on the loop can be expressed in terms of a derivative of B,. 


y axis 


b 


B 
| 1 circle 
> Z axis 
circle 7 B 
(edge view) 
magnetic 
field 
line 


— L/2 —+—L/2 — 


Figure 3.18: Non-uniform magnetic field acting on a current loop. 
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Elementary plasma waves 


4.1 General method for analyzing small amplitude waves 


All plasma phenomena can be described by combining Maxwell’s equations with the Lorentz 
equation where the Lorentz equation is represented by the Vlasov, two-fluid or MHD ap- 
proximations. The subject of linear plasma waves provides a good introduction to the study 
of plasma phenomena because linear waves are relatively simple to analyze and yet demon- 
strate many of the essential features of plasma behavior. 

Linear analysis, a straightforward method applicable to any set of partial differential 
equations describing a physical system, reveals the physical system’s simplest non-trivial, 
self-consistent dynamical behavior. In the context of plasma dynamics, the method is as 
follows: 

1. By making appropriate physical assumptions, the general Maxwell-Lorentz system of 
equations is reduced to the simplest set of equations characterizing the phenomena 
under consideration. 


2. An equilibrium solution is determined for this set of equations. The equilibrium might 
be trivial such that densities are uniform, the plasma is neutral, and all velocities are 
zero. However, less trivial equilibria could also be invoked where there are density 
gradients or flow velocities. Equilibrium quantities are designated by the subscript 0, 
indicating ‘zero-order’ in smallness. 


3. If f,g,h,etc. represent the dependent variables and it is assumed that a specific pertur- 
bation is prescribed for one of these variables, then solving the system of differential 
equations will give the responses of all the other dependent variables to this prescribed 
perturbation. For example, suppose that a perturbation €f, is prescribed for the depen- 
dent variable f so that f becomes 


f=foteft. (4.1) 


The system of differential equations gives the functional dependence of the other 
variables on f, and for example, would give g = g(f) = g(fo + fi). Since 
the functional dependence of g on f is in general nonlinear, Taylor expansion gives 
g = go + €g1 + ego + e293 +.... The €s are, from now on, considered implicit and 
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so the variables are written as 


f = fo T fi 
g = go+gı +92 +... 
h = hothythot.... (4.2) 


and it is assumed that the order of magnitude of fı is smaller than the magnitude of fo 
by a factor €, etc. The smallness of the perturbation is an assumption which obviously 
must be satisfied in the real situation being modeled. Note that there is no fz or higher 
f terms because the perturbation to f was prescribed as being fi. 


4. Each partial differential equation is re-written with all dependent quantities expanded 
to first order as in Eq.(4.2). For example, the two fluid continuity equation becomes 


alno +n: 
s + V- [(no +n) (uo + u1)] = 0. (4.3) 
By assumption, equilibrium quantities satisfy 
an 
T + V- (nouo) = 0. (4.4) 


The essence of linearization consists of subtracting the equilibrium equation (e.g., 
Eq.(4.4)) from the expanded equation (e.g., Eq.(4.3)). For this example such a sub- 


traction yields 

On, 

— + V- [niuo + nou + nu] = 0. (4.5) 
The nonlinear term nu; which is a product of two first order quantities is discarded 
because it is of order € whereas all the other terms are of order e. What remains 
is called the linearized equation, i.e., the equation which consists of only first-order 
terms. For the example here, the linearized equation would be 


— + V- [niuo + nou] = 0. 


The linearized equation is in a sense the differential of the original equation. 
Before engaging in a methodical study of the large variety of waves that can propagate 
in a plasma, a few special cases of fundamental importance will first be examined. 


4.2 Two-fluid theory of unmagnetized plasma waves 


The simplest plasma waves are those described by two-fluid theory in an unmagnetized 
plasma, i.e., a plasma which has no equilibrium magnetic field. The theory for these waves 
also applies to magnetized plasmas in the special situation where all fluid motions are 
strictly parallel to the equilibrium magnetic field because fluid flowing along a magnetic 
field experience no u x B force and so behaves as if there were no magnetic field. 

The two-fluid equation of motion corresponding to an unmagnetized plasma is 


du, 
Mone =Z = qon, E- VP, (4.6) 


and these simple plasma waves are found by linearizing about an equilibrium where uso = 
0, Eo = 0, and P,o are all constant in time and uniform in space. The linearized form of 
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Eq. (4.6) is then 

Ou, 
ot 
The electric field can be expressed as 


Monog i = qonooE: = VPs- (4.7) 


OA 
Ot 2 
a form that automatically satisfies Faraday’s law. The vector potential A is undefined with 
respect to a gauge since B = V x (A+ Vw) = V x A. It is convenient to choose w so as to 
have V-A = 0. This is called Coulomb gauge and causes the divergence of Eq.(4.8) to give 
Poisson’s equation so that charge density provides the only source term for the electrostatic 
potential ġ. Since Eq. (4.8) is linear to begin with, its linearized form is just 


E=-—V¢- (4.8) 


ðA: 
E, = -V¢, — a (4.9) 


4.2.1 Electrostatic (compressional caves) 


These waves are characterized by having finite V - u; and are variously called compres- 
sional, electrostatic, or longitudinal waves. The first step in the analysis is to take the 
divergence of Eq.(4.7) to obtain 
OV - Us 

moneo — = = qon V p — V? Pa. (4.10) 
Because Eq.(4.10) involves three variables (i.e., us1, 1, Ps1) two more equations are re- 
quired to provide a complete description. One of these additional equations is the linearized 
continuity equation 

aNs 


+noV- ug, =0 (4.11) 

which, after substitution into Eq.(4.10), gives 

Bno 
At 


For adiabatic processes the pressure and density are related by 


= GoNcoV", + V? Pr. (4.12) 


Mo 


P, 
— = const. (4.13) 
No 


where y = (N + 2)/N and N is the dimensionality of the system, whereas for isothermal 
processes 


Po 
— = const. (4.14) 
n 


The same formalism can therefore be used for both isothermal and adiabatic processes 
by using Eq. (4.13) for both and then simply setting y = 1 if the process is isothermal. 
Linearization of Eq.(4.13) gives 

P ol Nol 

DE Sap Od 4.15 

P, o0 Taco ( ) 
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so Eq. (4.12) becomes 


O7?n61 
At 
where P o = NgokT 50 has been used. 

Although this system of linear equations could be solved by the formal method of 
Fourier transforms, we instead take the shortcut of making the simplifying assumption 
that the linear perturbation happens to be a single Fourier mode. Thus, it is assumed that 
all linearized dependent variables have the wave-like dependence 


Mo = doNeoV Qi e KTV" No (4.16) 


Nei ~ explik - x—iwt), pı ~ exp(ik-x—iwt), etc. (4.17) 


so that V — ik and 0/dt — —iw. Equation (4.16) therefore reduces to the algebraic 
equation 


Mow Nor = doNcok? Qı te KT 50k? No 1 (4.18) 
which may be solved for n, to give 
dood k?p 
c = ; 4.19 
A Mo (w? — yk?kToo/Mo) ( ) 
Poisson’s equation provides another relation between ¢, and n,1, namely 
1 
E E E 4.20 
nme 3 Nord (4.20) 
Equation (4.19) is substituted into Poisson’s equation to give 
2 2 
No0q, k Qı 
a = z 4.21 
£ 2 €9Mg (w? — yk?kToo/Mo) (an 
which may be re-arranged as 
W 
1 ; ape =0 4.22 
>, (w? — yk?kToo/Mo) $ Ge) 
where > 
2 a Paoa (4.23) 


Po = coms 
is the square of the plasma frequency of species o. A useful way to recast Eq.(4.22) is 


(L+xe+xi)¢, =0 (4.24) 


where 


w2 


= po 

Xom (w2 — yk?KTo0/Mo) G23 

is called the susceptibility of species ø. In Eq.(4.24) the “1” comes from the “vacuum” part 
of Poisson’s equation (i.e., the LHS term V0) while the susceptibilities give the respective 
contributions of each species to the right hand side of Poisson’s equation. This formalism 
follows that of dielectrics where the displacement vector is D =cE and the dielectric con- 


stant is € = 1 + x where x is a susceptibility. 
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Equation (4.24) shows that if ¢, # 0, the quantity 1 + Xe + x, must vanish. In other 
words, in order to have a non-trivial normal mode it is necessary to have 


Pag +x; =0. (4.26) 


This is called a dispersion relation and prescribes a functional relation between w and k. 
The dispersion relation can be considered as the determinant-like equation for the eigen- 
values w(k) of the system of equations. 

The normal modes can be identified by noting that Eq.(4.25) has two limiting behaviors 
depending on how the wave phase velocity compares to \/KT,0/m,, a quantity which is 
of the order of the thermal velocity. These limiting behaviors are 

1. Adiabatic regime: w/k >> ./KT50/m, and y = (N + 2)/N. Because plane waves 
are one-dimensional perturbations (i.e., the plasma is compressed in the k direction 
only), N = 1 so that y = 3. Hence the susceptibility has the limiting form 


nee 
Woo 


w?(1 — yk?KT59/mow?) 


(4.27) 


2. Isothermal regime: w/k << \/KTz0/m, and y = 1. Here the susceptibility has the 
limiting form 


2 
we 1 


7 k?kTo0/Mo 7 kA o 


Xg (4.28) 


Figure 4.1 shows a plot of X, k? Ars versus w/k/KTo0/Mo. The isothermal and adia- 
batic susceptibilities are seen to be substantially different and, in particular, do not coalesce 
when w/k\/KT50/m, — 1. This non-coalescence as w/k,/KT,0/mM, — 1 indicates that 
the fluid description, while valid in both the adiabatic and isothermal limits, fails in the 
vicinity of w/k ~ y/KTs0/Mo. As will be seen later, the more accurate Vlasov descrip- 


tion must be used in the w/k ~ 4/KTo0/Mo regime. 
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Figure 4.1: Susceptibility x as a function of w/k y/ KTs0/Mo. 


Since the ion-to-electron mass ratio is large, ions and electrons typically have thermal 
velocities differing by at least one and sometimes two orders of magnitude. Furthermore, 
ion and electron temperatures often differ, again allowing substantially different electron 
and ion thermal velocities. Three different situations can occur in a typical plasma de- 
pending on how the wave phase velocity compares to thermal velocities. These situations 
are: 


1. Case where w/k >> VKT20/Me, VkKTio/m; 


Here both electrons and ions are adiabatic and the dispersion relation becomes 


We k? KT, Wri k? KT, 
1 s (1+3 aa) S (i3 sat) =o (4.29) 
w w? Me w w m 
Since We / Ww? = m;/m, the ion contribution can be dropped, and the dispersion 
becomes 
wee i KT 0 
1 z bee = 0. (4.30) 
we? WÀ Me 


To lowest order, the solution of this equation is simply w? = We An iterative solution 
may be obtained by substituting this lowest order solution into the thermal term which, 
by assumption, is a small correction because w/k >> y/KTeo/Me. This gives the 
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standard form for the high-frequency, electrostatic, unmagnetized plasma wave 


T 
w? = 02, a a, (4.31) 
This most basic of plasma waves is called the electron plasma wave, the Langmuir 


wave (Langmuir 1928), or the Bohm-Gross wave (Bohm and Gross 1949). 
. Case where w/k << V KTeo/Me, VKTio/mi 


Here both electrons and ions are isothermal and the dispersion becomes 


1 
1+5 Pe, = 0. (4.32) 


This has no frequency dependence, and is just the Debye shielding derived in Chapter 
1. Thus, when w/k << J KT.0/Me, v «To /m, the plasma approaches the steady- 
state limit and screens out any applied perturbation. This limit shows why ions cannot 
provide Debye shielding for electrons, because if the test particle were chosen to be 
an electron then its nominal speed would be the electron thermal velocity and from 
the point of view of an ion the test particle motion would constitute a disturbance 
with phase velocity w/k ~ ure which would then violate the assumption w/k << 
KTio / Mi. 


. Case where \/KTj9/m;i << w/k << y/KTeo/Me 


Here the ions act adiabatically whereas the electrons act isothermally so that the dis- 
persion becomes 


1 wi k2 KT; 
ieee E (4.33) 
kA W? w mi 
It is conventional to define the ‘ion acoustic’ velocity 
È Si Nps = KTe/mi (4.34) 
so that Eq.(4.33) can be recast as 
k2 2 k2 T; 
ee a a; (4.35) 
1+krp. w? mi 


Since w/k >> \/KTi9/m;, this may be solved iteratively by first assuming Tip = 
O giving 

we = a , 

14 kbe 

This is the most basic form for the ion acoustic wave dispersion and in the limit 
kd, >> 1, becomes simply w? = c?/ ADe = Ww. To obtain the next higher 
order of precision for the ion acoustic dispersion, Eq.(4.36) may be used to eliminate 
k? /w? from the ion thermal term of Eq.(4.35) giving 


(4.36) 


22 3 
2 kfc _ a2 KLio 
: 


Qi 552 
1+ kd, mi 


(4.37) 
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For self-consistency, it is necessary to have c >> KT;o/m;; if this were not true, 
the ion acoustic wave would become w? = 3k?kT;o/m; which would violate the 
assumption that w/k >> /KTio/mi. The condition c? >> KTi9/m; is the same 
as Te >> T; so ion acoustic waves can only propagate when the electrons are much 
hotter than the ions. This issue will be further explored when ion acoustic waves are 
re-examined from the Vlasov point of view. 


4.2.2 Electromagnetic (incompressible) waves 


The compressional waves discussed in the previous section were obtained by taking the 
divergence of Eq. (4.7). An arbitrary vector field V can always be decomposed into a 
gradient of a potential and a solenoidal part, i.e., it can always be written as V =Vw 
+V x Q where w and Q can be determined from V. The potential gradient Vw has zero 
curl and so describes a conservative field whereas the solenoidal term V x Q has zero 
divergence and describes a non-conservative field. Because Coulomb gauge is being used, 
the —V¢ term on the right hand side of Eq.(4.8) is the only conservative field; the -OA/0t 
term is the solenoidal or non-conservative field. 

Waves involving finite A have coupled electric and magnetic fields and are a generaliza- 
tion of vacuum electromagnetic waves such as light or radio waves. These finite A waves 
are variously called electromagnetic, transverse, or incompressible waves. Since no elec- 
trostatic potential is involved, V - E =0 and the plasma remains neutral. Because A <0, 
these waves involve electric currents. 

Since the electromagnetic waves are solenoidal, the —V¢ term in Eq. (4.7) is superflu- 
ous and can be eliminated by taking the curl of Eq. (4.7) giving 
2v X (MNc Uci) = -qone t, (4.38) 
To obtain an equation involving currents, Eq.(4.38) is integrated with respect to time, mul- 
tiplied by qo /Mo, and then summed over species to give 


V x Jı = —e0w?Bi (4.39) 
where 
w? = D Wee (4.40) 
However, Ampere’s law can be written in the form 
1 OF; 
SV Bi ej (4.41) 
Ho ðt 
which, after substitution into Eq. (4.39), gives 
1 OE, w? 
V x (v x Bı 2 OL ) a Bı. (4.42) 


Using the vector identity V x (V x Q) = V (V - Q)—V°Q and Faraday’s law this be- 
comes 


V7B, = z Wek Ba (4.43) 
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In the limit of no plasma so that we — 0, Eq.(4.43) reduces to the standard vacuum elec- 
tromagnetic wave. If it is assumed that B, ~ exp(ik - x — iwt), Eq.(4.43) becomes the 


electromagnetic, unmagnetized plasma wave dispersion 
w? = we + ke? (4.44) 


Waves satisfying Eq. (4.44) are often used to measure plasma density. Such a measurement 
can be accomplished two ways: 
1. Cutoff method 


If wW? < w? then k? becomes negative, the wave does not propagate, and only expo- 
nentially growing or decaying spatial behavior occurs (such behavior is called evanes- 
cent). If the wave is excited by an antenna driven by a fixed-frequency oscillator, the 
boundary condition that the wave field does not diverge at infinity means that only 
waves that decay away from the antenna exist. Thus, the field is localized near the 
antenna and there is no wave-like behavior. This is called cutoff. When the oscillator 
frequency is raised above wp, the wave starts to propagate so that a receiver located 
some distance away will abruptly start to pick up the wave. By scanning the trans- 
mitter frequency and noting the frequency at which the wave starts to propagate w? is 
determined, giving a direct, unambiguous measurement of the plasma density. 


2. Phase shift method 


Here the oscillator frequency is set to be well above cutoff so that the wave is always 
propagating. The dispersion relation is solved for k and the phase delay Ad of the 
wave through the plasma is measured by interferometric fringe-counting. The total 
phase delay through a length L of plasma is 


L L L 2 
1 1 Ww 

o= | baz = * f [oP ware f h- de (4.45) 
0 c Jo c Jo 2w 


so that the phase delay due to the presence of plasma is 


e2 L 
— Te 2 dx = — | ndz. (4.46) 
Qwemeeo 0 


Thus, measurement of the phase shift Ag due to the presence of plasma can be used to 
measure the average density along L; this density is called the line-averaged density. 


4.3 Low frequency magnetized plasma: Alfvén waves 


4.3.1 Overview of Alfvén waves 


We now consider low frequency waves propagating in a magnetized plasma, i.e. a plasma 
immersed in a uniform, constant magnetic field B = Boz. By low frequency, it is meant 
that the wave frequency w is much smaller than the ion cyclotron frequency wei. Several 
types of waves exist in this frequency range; certain of these involve electric fields having a 
purely electrostatic character (i.e., Vx E = 0), whereas others involve electric fields having 
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an inductive character (i.e., V x E Æ 0). Faraday’s law V x E = —0B/0t shows that 
if the electric field is electrostatic the magnetic field must be constant, whereas inductive 
electric fields must have an associated time-dependent magnetic field. 

We now further restrict attention to a specific category of these w << wei modes. This 
category, called Alfvén waves are the normal modes of MHD, involve magnetic perturba- 
tions and have characteristic velocities of the order of the Alfvén velocity v4 = B/,/figp. 
The existence of such modes is not surprising if one considers that ordinary sound waves 
have a velocity cs = ,/yP/p and the magnetic stress tensor scales as ~ B?/uop so 
that Alfvén-type velocities will result if P is replaced by B?/2j19. Two distinct kinds of 
Alfvén modes exist and to complicate matters these are called a variety of names by dif- 
ferent authors. One mode, variously called the fast mode, the compressional mode, or 
the magnetosonic mode resembles a sound wave and involves compression and rarefac- 
tion of magnetic field lines; this mode has a finite B,,. The other mode, variously called 
the Alfvén mode, the shear mode, the torsional mode, or the slow mode, involves twisting, 
shearing, or plucking motions; this mode has B,, = 0. This latter mode appears in two 
distinct versions when modeled using two-fluid or Vlasov theory depending on the plasma 
b; these are respectively called the inertial Alfvén wave and the kinetic Alfvén wave. 


4.3.2 Zero-pressure MHD model 


In order to understand the basic structure of these modes, the pressure will temporarily 
assumed to be zero so that all MHD forces are magnetic. The fundamental dynamics 
of both MHD modes comes from the polarization drift associated with a time-dependent 
perpendicular electric field, namely 


Mo dE iy 
qo B2? dt’ 
this was discussed in the derivation of Eq.(3.77). The polarization drift results in a polar- 
ization current 


(4.47) 


Ug polarization = 


J, = 5 Noda Ug polarization 
p dE. 
aen Shoe 4.48 
B? dt Oe) 
where p = X` NoMe is the mass density. This can be recast as 
dE, B? 
= SO 
dt Hop 
= vå (V x Bı), (4.49) 
where 5 
B 
v4 = — (4.50) 
ee HoP ; ; 
is the Alfvén velocity. Linearization and combining with Faraday’s law gives the two basic 
coupled equations underlying these modes, 


OF \1 


Ot = vi (V x Bi), 
3B 
BE = -Vx Enr (4.51) 
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The fields and gradient operator can be written as 


E = Ey 
B; = Bit Buz 
ð 
V = @—4V_, (4.52) 
Oz 


since Æ; = 0 in the MHD limit as obtained from the linearized ideal Ohm’s law 


E,+U,xB =0. (4.53) 


The curl operators can be expanded as 


ð 
VxE, = (+v) xE: 
Oz 
OE | 3 
= 27x = +Vı xE]; (4.54) 
Oz 
and 
, oO a 
(V x Bi), = 2ĉ— +V x (Bı + B212) 
Oz 7 
OB | 
= 2x — H tV, Bax? (4.55) 


where it should be noted that both V; x E,, and V, x B|; are in the z direction. 


Slow or Alfvén mode (mode where B,; = 0) TIn this case Bı = B_ and Eqs.(4.51) 
become 


ðE 4 OB 11 
a T Ar oz 
Bua . OF 
J = 2x a (4.56) 
This can be re-written as 
3 OB i141 
J (xE) = v} ay 
OB Ota 
a = ay x En) (4.57) 


which gives a wave equation in the coupled variables 2 x E; and B, ,. Taking a second 
time derivative of the bottom equation and then substituting the top equation gives the wave 
equation for the slow mode (Alfvén mode), 


(4.58) 


This is the mode originally derived by Alfven (1943). 
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4.3.3 Fast mode (mode where B,, 4 0) 


In this case only the z component of Faraday’s law is used and after crossing the top equa- 
tion with Z, Eqs.(4.51) become 


a dB dB. 
gBox? = aà (2x Fer + Vi Ba x2) x 2=08 ( =- vViBa) 


Bız 
ot 


SO x Ej =—V. (El: x 2). (4.59) 


Taking a time derivative of the bottom equation and then substituting for E; ı x Z gives 


Se VB.) . (4.60) 
Oz 
However, using V - Bı = Oit is seen that 


OBA 


V-Bu=- (4.61) 
Oz 
and so the fast wave equation becomes 
Bız Ov -Bii 
ee Fe ( az Vi Ba 
OB, 
aa, ( TA vÈ Ba) 
= vV’ Ba. (4.62) 


4.3.4 Comparison of the two modes 


The slow mode Eq.(4.58) involves z only derivatives and so has a dispersion relation 

w? = kuh (4.63) 
whereas the fast mode involves the V? operator and so has the dispersion relation 

w? = ku?. (4.64) 
The slow mode has B,,; = 0 and so its perturbed magnetic field is entirely orthogonal to 
the equilibrium field. Thus the slow mode magnetic perturbation is entirely perpendicular 
to the equilibrium field and corresponds to a twisting or plucking of the equilibrium field. 


The fast mode has B,, 4 0 which corresponds to a compression of the equilibrium field as 
shown in Fig.4.2. 
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Figure 4.2: Compressional Alfvén waves 


4.3.5 Finite-pressure analysis of MHD waves 


If the pressure is allowed to be finite, then the two modes become coupled and an acoustic 
mode appears. Using the vector identity VB? = 2(B-VB + Bx V x B) the J x B force 
in the MHD equation of motion can be written as 


B? 1 
JxB=-V + —B-VB. (4.65) 
2 Ho Ho 
The MHD equation of motion thus becomes 
DU B? 1 
p =-V (r ) | B-VB. (4.66) 
Dt 2Ho Ho 


Linearizing this equation about a stationary equilibrium where the pressure and the density 
are uniform and constant, gives 


OU; B-B 1 
p— = v (r, ' ) H= Bi VBE (4.67) 
ðt Ho Ho 
The curl of the linearized ideal MHD Ohm’s law, 
E, + U, x B=0, (4.68) 
gives the induction equation 
OB 
—* =V x (U; xB), (4.69) 


ot 
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while the linearized continuity equation 


ə 
CPL aoe (4.70) 
ot 
together with the equation of state 
Pi Pi 
Sy (4.71) 
P p 
give 
OP. 
2 =-yPV U. (4.72) 


To obtain an equation involving U, only, we take the time derivative of Eq.(4.67) and use 
Eqs.(4.69) and (4.72) to eliminate the time derivatives of P; and B,. This gives 
07U, 1 
=, = -V[-yPV-Ui4+—B-Vx(UixB 
Pp (-1 a x (U, x B) 
(4.73) 


+- (B-V)V x (Ui xB). 
0 


This can be simplified using the identity V - (a x b) = b-V x a—a-V x b so that 


B- V x (U1 x B) = V- [(U; x B) x B] = -B?°V - U41. (4.74) 
Furthermore, 
B-V= B? = ik,B. (4.75) 
Using these relations Eq. (4.73) becomes 
U1 _y (2V-U,+v4V- Ui) +ikv4V x (U; x 2). (4.76) 
Ot? 7 


To proceed further we take either the divergence or the curl of this equation to obtain 
expressions for compressional or incompressible motions. 


4.3.6 MHD compressional (fast) mode 


Here we take the divergence of Eq. (4.76) to obtain 


2 
a =V?(2V-U,+v4V- U11) (4.77) 
or 

w? V-U1 = (ki +k?) (2V -Ui + 04V- U1). (4.78) 
On the other hand if Eq.(4.76) is operated on with V1 = V — ik,Z we obtain 


07V-U; . . 
a 1 = V? (2V - U, +04V- U11) + k23- Vx (U x2). (4.79) 
Using 
V x (Ui x Ż) = 2- VU — 2V - U; = ik;-U11 — 2V - U: (4.80) 
Eq.(4.79) becomes 


wV -U= k? (VU +04V- Ui) + kwiv- Ui. (4.81) 
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Equations (4.78) and (4.81) constitute two coupled equations in the variables V - U, 
and V1 - U; į, namely 


(w? — k2) V-U, — kv Vi Uy, = 


k? EV- U + (kv, -aV U = 0. (4.82) 
These coupled equations have the determinant 
(w? — k?) (Kvi — w?) +k wik e= 0 (4.83) 
which can be re-arranged as a fourth order polynomial in w, 
wt — wk? (vå +c?) +k kwh =0 (4.84) 
having roots 
TE k? (v4 +2) + yr! (v2 +2)? - MERI CE dis 


2 
Thus, according to the MHD model, the compressional mode dispersion relation has the 
following limiting forms 


=k (A+) ifk, = 0 (4.86) 
w? Shen 
or ifk? = 0. (4.87) 
uan ee 
4.3.7 MUD shear (slow) mode 
It is now assumed that V - U, = 0 and taking the curl of Eq.(4.76) gives 
a@VxUı 2 OU; ^ 
a = vá V xV x ETA xX Z 
OU OU OU, OU 
= 2 is vee eee 1 2V - 1 Ey A 
= v4Vx a Wee ae 2V De Az we 
zero z~ zero 
2 8? 
= vaga Y x Ui (4.88) 


where the vector identity V x (F x G) = FV - G + G- VE — GV - F — F- VG has been 
used. 

Equation (4.88) reduces to the slow wave dispersion relation Eq.(4.63). The associated 
spatial behavior is such that V x U, Æ 0, and the mode is unaffected by existence of 
finite pressure. The perturbed magnetic field is orthogonal to the equilibrium field, i.e., 
Bı -B = 0, since it has been assumed that V - U; = 0 and since finite Bı -B corresponds 
to finite V - U4. 
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4.3.8 Limitations of the MHD model 


The MHD model ignores parallel electron dynamics and so has a shear mode dispersion 
w? = k2v> that has no dependence on k, . Some authors interpret this as a license to allow 
arbitrarily large kų in which case a shear mode could be localized to a single field line. 
However, the two-fluid model of the shear mode does have a dependence on kı which 
becomes important when either k] €/wpe or kp, become of order unity (whether to use 
C/Wpe Or p, depends on whether Bm,;/m-, is small or large compared to unity). Since 
C/Wpe and p, are typically small lengths, the MHD point of view is acceptable provided 
the characteristic length of perpendicular localization is much larger than c/wpe or p,. 

MHD also predicts a sound wave which is identical to the ordinary hydrodynamic sound 
wave of an unmagnetized gas. The perpendicular behavior of this sound wave is consistent 
with the two-fluid model because both two-fluid and MHD perpendicular motions involve 
compressional behavior associated with having finite B,,. However, the parallel behavior 
of the MHD sound wave is problematical because Æ, is assumed to be identically zero 
in MHD. According to the two-fluid model, any parallel acceleration requires a parallel 
electric field. The two-fluid B,; mode is decoupled from the two-fluid E’,; mode so that 
the two-fluid Bı mode is both compressional and has no parallel motion associated with 
it. 

The MHD analysis makes no restriction on the electron to ion temperature ratio and 
predicts that a sound wave would exist for T, = T;. In contrast, the two-fluid model shows 
that sound waves can only exist when JT. >> T; because only in this regime is it possible 
to have KT; /m; << w?/k2 << KT, /me and so have inertial behavior for ions and kinetic 
behavior for electrons. 

Various paradoxes develop in the MHD treatment of the shear mode but not in the two- 
fluid description. These paradoxes illustrate the limitations of the MHD description of a 
plasma and shows that MHD results must be treated with caution for the shear (slow) mode. 
MHD provides an adequate description of the fast (compressional) mode. 


4.4 Two-fluid model of Alfvén modes 


We now examine these modes from a two-fluid point of view. The two-fluid point of view 
shows that the shear mode occurs as one of two distinct modes, only one of which can exist 
for given plasma parameters. Which of these shear modes occurs depends upon the ratio of 
hydrodynamic pressure to magnetic pressure. This ratio is defined for each species ø as 


nkTo 
bo = BI 

/Lo 
the subscript ø is not used if electrons and ions have the same temperature. 6; measures 
the ratio of ion thermal velocity to the Alfvén velocity since 


(4.89) 


2 : 
vp, — KT; /m; 


= = 6.. 4. 
v4 BP /nmipig Bi CRU 


Thus, vr; << va corresponds to 8; << 1. Magnetic forces dominate hydrodynamic 
forces in a low 8 plasma, whereas in a high 8 plasma the opposite is true. 
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The ratio of electron thermal velocity to Alfvén velocity is also of interest and is 


2 : 
up. _ KIe/Me — mi 


v  B?/nmip Me 


pe (4.91) 
Thus, v7, >> v3 when 8, >> Me/m; and v3,, << v3 when 8, << m,/mj;. Shear 
Alfvén wave physics is different in the 6, >> Me/M; and Be << Me/Mi; regimes which 
therefore must be investigated separately. MHD ignores this 6, dependence, an oversim- 
plification which leads to the paradoxes. 

Both Faraday’s law and the pre-Maxwell Ampere’s law are fundamental to Alfvén wave 
dynamics. The system of linearized equations thus is 


3B 
VxE, = ra (4.92) 
Vx Bı = koJ1- (4.93) 


If the dependence of Jı on E; can be determined, then the combination of Ampere’s 
law and Faraday’s law provides a complete self-consistent description of the coupled fields 
E, B; and hence describes the normal modes. From a mathematical point of view, speci- 
fying J,(E,) means that there are as many equations as dependent variables in the pair of 
Eqs.(4.92),(4.93). The relationship between J; and E; is determined by the Lorentz equa- 
tion or some generalization thereof (e.g., drift equations, Vlasov equation, fluid equation 
of motion). The MHD derivation used the polarization drift to give a relationship between 
Jı, and E,, but leaves ambiguous the relationship between J, | and Ey), 

The two-fluid equations provide a definite description of the relationship between Jj, 
and E;j, At frequencies well below the cyclotron frequency, decoupling of modes also 
occurs in the two-fluid description, and this decoupling is more clearly defined and more 
symmetric than in MHD. The decoupling in a uniform plasma results because the depen- 
dence of Jı on E; has the property that Jj, ~ Fy, and Ji, ~ E,_. Thus, for w << wci 
there is a simple linear relation between parallel electric field and parallel current and an- 
other distinct simple linear relation between perpendicular electric field and perpendicular 
current; these two linear relations mean that the tensor relating Jı to E is diagonal (at 
higher frequencies this is not the case). The decoupling can be seen by supposing that all 
first order quantities have the dependence exp(ik, - x + ik,z) where k) = kê + ky. 
Here k | is the unit vector in the direction of k | and z x k is the binormal unit vector so 
that the set Å| ,2 x k 1, ĉ form a right-handed coordinate system. Mode decoupling can be 
seen by examining the table below which lists the electric and magnetic field components 
in this coordinate system: 


E components B components 


ki- E] ki -Bı 
2 xk- E 2 xk- Bı 
2. E 2- Bı 


Because of the property that Jız ~ Eız and J; ~ E11 the terms in boxes are decoupled 
from the terms not in boxes. Hence, one mode consists solely of interrelationships between 
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the boxed terms (this mode is called the Æ, mode since it has finite Æ,) and the other 
distinct mode consists solely of interrelationships between the unboxed terms (this mode 
is called the B, mode since it has finite B,). Since the modes are decoupled, it is possible 
to “turn off” the Æ, mode when considering the B, mode and vice versa. If the plasma is 
non-uniform, the Æ, and B, modes can become coupled. 

The ideal MHD formalism sidesteps discussion of the Æ, mode. Instead, two discon- 
nected assumptions are invoked in ideal MHD, namely (i) it is assumed that E,; = 0 and 
(ii) the parallel current J,; is assumed to arrange itself spontaneously in such a way as 
to always satisfy V - Jı = 0. This pair of assumptions completes the system of equa- 
tions, but omits the parallel dynamics associated with the Æ, mode and instead replaces 
this dynamics with an assumption that J,; is determined by some unspecified automatic 
feedback mechanism. In contrast, the two-fluid equations describe how particle dynamics 
determines the relationship between J,; and E,,. Thus, while MHD is both simpler and 
self-consistent, it omits some vital physics. 


The two-fluid model is based on the linearized equations of motion 


usi 
ot 
Charge neutrality is assumed so that n; = Nne = n. Also, the pressure term is 


Mon = ngo (Ei + uci X B) — V- Poi. (4.94) 


Pay 0 0 OP sor 
V-Pa=V-]| 0 P ıı 0 = V, Pia tea (4.95) 
0 0 Prat í 


Assuming w << w,; implies w << Wee also and so perpendicular motion can be 
described by drift theory for both ions and electrons. However, here the drift approximation 
is used for the fluid equations, rather than for a single particle. Following the drift method 
of analysis, the left hand side of Eq.(4.94) is neglected to first approximation, resulting in 


usı X B x —Ey, + Vi Po 11/146 (4.96) 
which may be solved for us , ; to give 


Ey xB VP, 11 X B 
B? Nd, B? 

The first term is the single-particle Æ x B drift and the second term is called the 
diamagnetic drift, a fluid effect that does not exist for single-particle motion. Because 
u, 1 1S time-dependent there is also a polarization drift. Recalling that the form of the 
single-particle polarization drift for electric field only is vp = mE, /qB? and using 
E,, — V.Po11/nqe for the fluid model instead of just E; | for single particles (cf. right 
hand side of Eq.(4.96)) the fluid polarization drift is obtained. With the inclusion of this 
higher order correction, the perpendicular fluid motion becomes 


Eı xB VPsi1 xB Mo Mo : 
H E VPs- 4.98 
B2 nap? pB H pB? Eo (4.98) 
The last two terms are smaller than the first two terms by the ratio w/wes and so may 
be ignored when the electron and ion fluid velocities are considered separately. However, 


Usii= (4.97) 


Ug1l1 = 
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when the perpendicular current, i.e., J1} = S>nq,U, 1 is considered, the electron and ion 
E x B drift terms cancel so that the polarization terms become the leading terms involving 
the electric field. Because of the mass in the numerator, the ion polarization drift is much 
larger than the electron polarization drift. Thus, the perpendicular current comes from ion 
polarization drift and diamagnetic current 


Lignm, FE, y VP xB alee by V Pui xB (4 99) 


MoS 11 = B2 - B2 EA El B2 
where Pı = X P11. The term involving P i 1 has been dropped because it is small by 
w/we compared to the P|; term. 


The center of mass perpendicular motion is 


J MgNUg 11 
Emon 
An important issue for the perpendicular motion is whether u, | ı is compressible or incom- 

pressible. Let us temporarily ignore parallel motion and consider the continuity equation 


Ui = Z Uili (4.100) 


2m +nV -usi =Q. (4.101) 
Ot 
If V - u,i1 = 0, the mode does not involve any density perturbation, i.e., nı = 0, and 
is said to be an incompressible mode. On the other hand, if V - u,,, Æ O then there are 
fluctuations in density and the mode is said to be compressible. 
To proceed further, consider the vector identity 


V- (Fx G)=GVxF-F-VxG. 


If G is spatially uniform, this identity reduces to V - (F x G) = G-VxF which in turn 
vanishes if F is the gradient of a scalar (since the curl of a gradient is always zero). Taking 
the divergence of Eq.(4.98) and ignoring the polarization terms (they are of order w/w; 
and are only important when calculating the current which we are not interested in right 
now) gives 
V-Usii = BY x E; = DG x E (4.102) 

to lowest order. Setting E; = —V¢ (i.e., assuming that the electric field is electrostatic) 
would cause the right hand side of Eq.(4.102) to vanish, but such an assumption is overly 
restrictive because all that matters here is the z-component of Vx E,. The z-component 
of V x E; involves only the perpendicular component of the electric field (i.e., only the x 
and y components of the electric field) and so the least restrictive assumption for the right 
hand side of Eq.(4.102) to vanish is to have E1 | = —V _ ¢. Thus, one possibility is to have 
E,, = —V_¢ in which case the perpendicular electric field is electrostatic in nature and 
the mode is incompressible. 

The other possibility is to have 2-VxE, 4 0. In this case, invoking Faraday’s law 
reduces Eq.(4.102) to 


Vale =) 2 


= = (4.103) 
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Combining Eqs.(4.103) and (4.101) and then integrating in time gives 


m Ba 
n B 


(4.104) 


which shows that compression/rarefaction is associated with having finite B,,. 
In summary, there are two general kinds of behavior: 


1. Modes with incompressible behavior; these are the shear modes and have nı = O, 
V-ugii1 = 0, Ey) = —V_¢ and B71 = 0, 


2. Modes with compressible behavior; these are the compressible modes and have nı 4 
0,V- Ug 11 #0, VxEı1 # 0, and B,, #0. 

Equation (4.99) provides a relationship between the perpendicular electric field and the 
perpendicular current. A relationship between the parallel electric field and the parallel 
current is now required. To obtain this, all vectors are decomposed into components par- 
allel and perpendicular to the equilibrium magnetic field, i.e., E1= E1 1 + £12 etc. The 
V operator is similarly decomposed into components parallel to and perpendicular to the 
equilibrium magnetic field, i.e., V = V, + 20/0z and all quantities are assumed to be 
proportional to f(x,y) exp(ik,z — iwt). Thus, Faraday’s law can be written as 


Vi xE., +V x Battin xE;= -2 (Bit Ba?) (4.105) 
which has a parallel component 
Z-Vi x Ei, =iwBy (4.106) 
and a perpendicular component 
(VEz —ikzE11) x Z2=iwBi4. (4.107) 
Similarly Ampere’s law can be decomposed into 
2- Vı x Bai = boda (4.108) 


and 
(V_ Bai —ikzBi1) x ĉ = uJ 11. (4.109) 
Substituting Eq.(4.99) into Eq.(4.109) gives 


i Sias 
A ee) kee ge (4.110) 
vå B 
or, after re-arrangement, 
HoP aS 7 iw 
Vi (Ba ' ) x ĉ—ik-Bj1 x 2 =- Ejı. (4.111) 
B U4 


The slow (shear) and fast (compressional modes) are now considered separately. 
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4.4.1 Two-fluid slow (shear) modes 


As discussed above these modes have B,; = 0, Ei 1 = —V¢,, and V-u,z 11 = 0. We first 
consider the parallel component of the linearized equation of motion, namely 
OUg 21 OF. ol 
Zl = ng, Ez — 

g T ee py 
where Pi = YonoiKTo and y = 1 if the motion is isothermal and y, = 3 if the motion 


is adiabatic and the compression is one-dimensional. The isothermal case corresponds to 
w? /k? << KT, /Mo and vice versa for the adiabatic case. 


NMo 


(4.112) 


The continuity equation is 


o 
= +V- (nus) =0. (4.113) 


Because the shear mode is incompressible in the perpendicular direction, the continuity 


equation reduces to 


ð ð 
CA e E nia) 0. (4.114) 


o z 
Taking the time derivative of Eq.(4.112) gives 
OP tiga KIo 3?uoz1 qo OF 1 


o em Be eee (4.115) 


which is similar to electron plasma wave and ion acoustic wave dynamics except it has not 
been assumed that Æ; is electrostatic. 

Invoking the assumption that all quantities are of the form f(x,y) exp(ik,z — iwt) 
Eq.(4.115) can be solved to give 


iwqo Ez 
sel = 4.116 
ee Mo W? — Yo k?KTo /Mo ( ) 
and so the relation between parallel current and parallel electric field is 
Jn = 2 E, 4.117 
Hog z1 1 2 Wa E F |e ( ) 


Using 2- V x By=V- (Bı x Z) = V- (B11 x Z) the parallel component of Ampere’s 
law becomes for the shear wave 


Vo ‘(Bit xXZ)= 


HE, 4.118 
a k2KTo/Mo ( ) 
Ion acoustic wave physics is embedded in Eq.(4.118) as well as shear Alfvén physics. 
The ion acoustic mode can be retrieved by assuming that the electric field is electrostatic 
in which case B ıı vanishes. For the special case where the electric field is just in the 
z direction, and assuming that KT;/m; << w?/k2 << «T./mez the right hand side of 


Eq.(4.118) becomes 
w, 1 
(= — —,— | Ea =0 (4.119) 
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which gives the ion acoustic wave w? = Re KT. /m, discussed in Sec.4.2.1. This shows that 
the acoustic wave is associated with having finite F',, and also requires T, >> T; in order 
to exist. 

Returning to shear waves, we now assume that the electric field is not electrostatic so 
Bı does not vanish and Eq.(4.118) has to be considered in its entirety. For shear waves 
the character of the parallel current changes depending on whether the wave parallel phase 
velocity is faster or slower than the electron thermal velocity. The w?/k? >> KTe/me 
case is called the inertial limit while the w?/k? << KT, /me case is called the kinetic 
limit. The perpendicular component of Faraday’s law is 


VEz x ĉ-— ik;E1ı x = iwB]ı. (4.120) 


Substitution of E; ı as determined from Eq.(4.111) into Eq.(4.120) gives 


ic VP w? 
Vib kei (= Ll y 2— ikB] x :) x2= By 4121 
vå B U4 
which may be solved for B | ; to give 
1 f 5 sp „2 HoViP i 
Bi = wW? ke, (wien xX Z4 1k,U4 B (4.122) 


Bıı x= 
w2 — k2y2 
w* — k wi 


Substitution of B, 1 x Z into Eq.(4.118) gives 


1 MoViPir — Woo [C 
Vi- Set VLE + kv% 2 = E, 2 : 
a (zl ab OA wB ve 1 Ds aa, KnTo Fi 


(4.124) 
However, because V, - (V1 Pii x 2) = V- (VPL, x Z) = 0 the term involving pressure 
vanishes, leaving an equation involving F’,; only, namely 


> Y1 P 
2o Yes x2). (4.123) 


(wie. bik,vy A 


V eee wee VE E wpol È = 0 4.125 
EA EER AE apaa eee Sa wire) 


This is the fundamental equation for shear waves. On replacing V, — ik,, Eq.(4.125) 
becomes 


2 a2 32. 
ki _ Ype 1 _ Mp 1 


; 3 1 2, z T 2, T. a 
w? — kwi  @ w? — yek?kTe/Me C2 w? — qik?kT;/ m; 


0. (4.126) 


In the situation where w?/k? >> KT./me, the second term dominates the third term 
since we, >> Wri and so Eq.(4.126) can be recast as 


2d 
2 Kv 


w = — 4 _ 
1+ kiewe 


(4.127) 
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which is called the inertial Alfvén wave (IAW). If an ce | wee is not too large, then w/k, 
is of the order of the Alfvén velocity and the condition w? >> k2«T./m, corresponds to 
A >> KTe/Me or 
= By, << n (4.128) 
Thus, inertial Alfvén wave shear modes exist only in the ultra-low 8 regime where 6, << 
Me /Mj. 
In the situation where KT;/m; << w?/k? << KT. /me, Eq.(4.126) can be recast as 


2 2 2 
ki Whe 1 i Wi 1 
( 2 7 . AY 
(w2—k2u4) cc? k2KTe/Me Cc w? 


= 0. (4.129) 


Because w? appears in the respective denominators of two distinct terms, Eq.(4.129) is 
fourth order in w? and so describes two distinct modes. Let us suppose that the mode in 
question is much faster than the acoustic velocity, i.e., w? / k2 >> KTe/m;. In this case the 
ion term can be dropped and the remaining terms can be re-arranged to give 


k? KT. @ 
w? = ko (14 LL); (4.130) 
z“ A 2 2 
V4 Me Wee 
this is called the kinetic Alfvén wave (KAW). 
1 KT. ¢? 1 KT. 
ae age es (4.131) 


Faye w2 w2 ; 
UA Me Whe Wii Mi 


as a fictitious ion Larmor radius calculated using the electron temperature instead of the 
ion temperature, the kinetic Alfvén wave (KAW) dispersion relation can be expressed more 
succinctly as 
w? = kuh (+k? e). (4.132) 

If k? p? is not too large, then w/k, is again of the order of v 4 and so the condition w? << 
k2KTe/Me corresponds to having 3, >> Mej/m;. The condition w?/k? >> KTe/m; 
which was also assumed corresponds to assuming that 8, << 1. Thus, the KAW dispersion 
relation Eq.(4.132) is valid in the regime m./m; << be << 1. 

Let us now consider the situation where w? / k2 << K«KT;/mji, kKTe/Me. In this case 
Eq.(4.126) again reduces to 


w? = k2v% (1 +k? o) (4.133) 
but this time 


2_ K(Te+T:) 
This situation would describe shear modes in a high 8 plasma (ion thermal velocity faster 
than Alfvén velocity). 

To summarize: the shear mode has B,; = 0, Ey, 4 0, Ja # 0, Eji = -Vo, 
and exists in the form of the inertial Alfvén wave for Gb. << Me/m; and in the form 
of the kinetic Alfvén wave for 6e >> Me/mi;. The shear mode involves incompressible 
perpendicular motion, i.e., V- ug. = ik, - ug) 1 = 0, which means that k, is orthogonal 
to u, 1. For example, in Cartesian geometry, this means that if u, | ; is in the x direction, 
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then k, must be in the y direction while in cylindrical geometry, this means that if u, | ; is 
in the 0 direction, then k | must be in the r direction. The inertial Alfvén wave is known as 
a cold plasma wave because its dispersion relation does not depend on temperature (such 
a mode would exist even in the limit of a cold plasma). The kinetic Alfvén wave depends 
on the plasma having finite temperature and is therefore called a warm plasma wave. The 
shear mode can be coupled to ion acoustic modes since both shear and ion acoustic modes 
involve finite F’,,. 


4.4.2 Two-fluid compressional modes 


The compressional mode involves assuming that B,, is finite and that F,; = 0. Having 
F,, = O means that there is no parallel motion and, in particular, implies that J,; = 0. 
Thus, for the compressional mode Faraday’s law has the form 


Vi-(Ei1xZ) = wBa (4.135) 
—ik; -E1 x2 = iwB 1. (4.136) 
Using Eq.(4.136) to substitute for B | 1 in Eq.(4.111) and then solving for E}; gives 
ney) 
o iw oP it 7 
Eli = ney (Ba + rom) XK Z. (4.137) 
Since er 
: iwv HoP 
Eli x ĉ =- V] (B+) (4.138) 
w? — ku, B 
Eq.(4.135) becomes 
v3 MoPi1 
Vi-(= 45 V7. (Ba + OS Bu =0. 4.1 
a (atig: (Pa +54) +80 aa 


If we assume that the perpendicular motion is adiabatic, then 


Pi m% Ba 


A are. (4.140) 
Substitution for P, ; in Eq.(4.139) gives 
2 2 
(v4 +65) 
Vi: (Seven) +Bya= (4.141) 
where PE 
Ê= ap (4.142) 
Mi 


On replacing V 1 — ik_, Eq.(4.141) becomes the dispersion relation 


2 (4,2 2 
ki (v3 +c?) 
Ao 2 
wek tA 


+1=0 (4.143) 


or 

w? = kw +k? ce? (4.144) 
where k? = k2 + k2: Since V-ugi1 = ik, - us11ı Æ 0, the perpendicular wave vector 
kı is at least partially co-aligned with the perpendicular velocity. 
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4.4.3 Differences between the two-fluid and MHD descriptions 


The two-fluid description shows that the slow mode (finite Æ) appears as either an inertial 
or a kinetic Alfvén wave depending on the plasma 8; the MHD description assumes that 
E, = 0 for this mode and does not distinguish between inertial and kinetic modes. The 
two-fluid description also shows that finite Æ, will give ion acoustic modes in the parallel 
direction which are decoupled. The MHD description predicts a so-called sound wave 
which differs from the ion acoustic wave because the MHD sound wave does not have the 
requirement that Te >> 7T;; the MHD sound wave is an artifact for parallel propagation 
in a plasma with low collisionality (if the collisions are sufficiently large, then the plasma 
would behave like a neutral gas). Then MHD description predicts a coupling between 
oblique sound waves via a square root relation (see Eq.(4.85)) which does not exist in the 
two-fluid model. 


4.5 Assignments 


1. Plot frequency versus wavenumber for the electron plasma wave and the ion acoustic 
wave in an unmagnetized Argon plasma which has n = 10! m~?, Te = 10 eV, and 
T; = 1evV. 


2. Let Ad be the difference between the phase shift a Helium-Neon laser beam expe- 
riences on traversing a given length of vacuum and on traversing the same length of 
plasma. What is Ag when the laser beam passes through 10 cm of plasma having a 
density of n = 10?2m*? How could this be used as a density diagnostic? 


3. Prove that the electrostatic plasma wave w? 


as 


= we + 3k?KT./m. can also be written 


w? = w? (1+ 3k? ADe) 
and show over what range of RABe the dispersion is valid. Plot the dispersion w (k) 
versus k for both negative and positive k. Next plot on the same graph the electromag- 
netic dispersion w? = wee +k?c? and show the limits of validity. Plot the ion acoustic 
dispersion w? = k?c?/(1 + Rx on this graph showing its region of validity. Fi- 
nally plot the ion acoustic dispersion with a finite ion temperature. Show the limits of 
validity of the ion acoustic dispersion. 


4. Physical picture of plasma oscillations: Suppose that a plasma is cold and initially 
neutral. Consider a spherical volume of this plasma and imagine that a thin shell 
of electrons at spherical radius r having thickness ôr moves radially outward by a 
distance equal to its thickness. Suppose further that the ions are infinitely massive 
and cannot move. What is the total ion charge acting on the electrons (consider the 
charge density and volume of the ions left behind when the electron shell is moved 
out)? What is the electric field due to these ions. By considering the force due to this 
electric field on an individual electron in the shell, show that the entire electron shell 
will execute simple harmonic motion at the frequency wpe. If the ions had finite mass 
how would you expect the problem to be modified (hint-consider the reduced mass)? 


5. Suppose that an MHD plasma immersed in a uniform magnetic field B = Boz has 
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an oscillating electric field E, where L means in the direction perpendicular to 2. 
What is the polarization current associated with E , ? By substituting this polarization 
current into the MHD approximation of Ampere’s law, find a relationship between 
JE, /Ot and a spatial operator on B. Use Faraday’s law to obtain a similar relationship 
between 0B, /Ot and a spatial operator on E. Consider a mode where F(z, ¢) and 
B y(Z, t) are the only finite components and derive a wave equation. Do the same for 
the pair F(z, t) and B, (z, t). Which mode is the compressional mode and which is 
the shear mode? 


Streaming instabilities and the Landau 
problem 


5.1 Streaming instabilities 
The electrostatic dispersion relation for a zero-temperature plasma is simply 


w2 
1-) 2 =0 (5.1) 


w2 
o 


indicating that a spatially-independent oscillation at the plasma frequency 


Wp = q/ We + w2, (5.2) 


is a normal mode of a cold plasma. Once started, such an oscillation would persist in- 
definitely because no dissipative mechanism exists to quench it. On the other hand, the 
oscillation would have to be initiated by some source, because no available free energy 
exists from which the oscillation could draw to start spontaneously. 

We now consider a slightly different situation where instead of being at rest in equilib- 
rium, cold electrons or ions stream at some spatially-uniform initial velocity. In the special 
situation where electrons and ions have the same initial velocity, the center of mass would 
also move at this initial velocity and one could simply move to the center of mass frame 
where both species are stationary and so, as argued in the previous paragraph, an oscillation 
would not start spontaneously. However, in the more general situation where the electrons 
and ions stream at different velocities, then both species have kinetic energy in the center 
of mass frame. This free energy could drive an instability. 

In order to determine the conditions where such an instability could occur, the situation 
where each species has the equilibrium streaming velocity uso will now be examined. The 
linearized equation of motion, the linearized continuity equation, and Poisson’s equation 
become respectively 


OUs1 do 

ra + Uso © Vusi = ya Oh (5.3) 
Ong 

ae + Uoo: Vnori = —No0V + U1, (5.4) 
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and i 
y? TS olle : 
Qı = 2 Nol. (5.5) 


As before, all first-order dependent variables are assumed to vary as exp(ik - x—iwt). 
Combining the equation of motion and the continuity equation gives 


k? qo 
(w — k - uso)? Mmo 


Noi = Noo Qı : (5.6) 
Substituting this into Eq.(5.5) gives the dispersion relation 
2 


Wp er 
1 2, a. aa = 0 (5.7) 


which is just like the susceptibility for stationary cold species except that here w is replaced 
by the Doppler-shifted frequency wWpoppier = w — k- Ugo. 

Two examples of streaming instability will now be considered: (i) equal densities of 
positrons and electrons streaming past each other with equal and opposite velocities, and 
(ii) electrons streaming past stationary ions. 

Positron-electron streaming instability 

The positron/electron example, while difficult to realize in practice, is worth analyzing 
because it reveals the essential features of the instability with a minimum of mathematical 
complexity. The equilibrium positron and electron densities are assumed equal so as to 
have charge neutrality. Since electrons and positrons have identical mass, the positron 
plasma frequency w,, is the same as the electron plasma frequency wpe. Let ug be the 
electron stream velocity and —uo be the positron stream velocity. Defining z = w/wpe 
and À = k- uo/wpe, Eq. (5.7) reduces to 

1 1 
(z—A)2 ° (z+)?’ 
a quartic equation in z. Because of the symmetry, no odd powers of z appear and Eq.(5.8) 
becomes 


l= 


(5.8) 


zt — 22A? Pax S20 (5.9) 
which may be solved for 2? to give 


z2? = (AX? +1) + v4? +1. (5.10) 


Each choice of the + sign gives two roots for z. If z? > 0 then the two roots are real, equal 
in magnitude, and opposite in sign. On the other hand, if z? < 0, then the two roots are 
pure imaginary, equal in magnitude, and opposite in sign. Recalling that w = wpez and 
that the perturbation varies as exp(ik - x—iwt), it is seen that the positive imaginary root 
z = +i|z| corresponds to instability; i.e., to a perturbation which grows exponentially in 
time. 

Hence the condition for instability is z? < 0. Only the choice of minus sign in Eq.(5.10) 
allows this possibility, so choosing this sign, the condition for instability is 


VAP +1>A?°+1 (5.11) 


C2472. (5.12) 


which corresponds to 
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The maximum growth rate is found by maximizing the right hand side of Eq.(5.10) with 
the minus sign chosen. Taking the derivative with respect to À and setting dz/d\ = 0 to 
find the maximum, gives 


5702 33s oe 


dà VAN? +1 
or \ = \/3/2. Substituting this most unstable \ back into Eq.(5.10) (with the minus sign, 
since this is the potentially unstable root) gives the maximum growth rate to be y = 1/2 
where z = x + ly. 
Changing back to physical variables, it is seen that onset of instability occurs when 


kuo < V2wpe, 


and the maximum growth rate occurs when 


kuo = D Wpe 


in which case 
We 
W=i f 


Figure 5.1 plots the normalized instability growth rate Im z as a function of A; both on- 
set and maximum growth rate are indicated. Since the instability has a pure imaginary 
frequency it is called a purely growing mode. Because the growth rate is of the order of 
magnitude of the plasma frequency, the instability grows extremely quickly. 


L0 
Ymax = 0.5 
0.5 b | 
0 Bis 2 2.0 A 


Figure 5.1: Normalized growth rate v. normalized wavenumber 


Electron-ion streaming instability 
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Now consider the more realistic situation where electrons stream with velocity vo 
through a background of stationary neutralizing ions. The dispersion relation here is 


A 32 
1 a ps =0 1 
w (w—k- w)? Pee 


which can be recast in non-dimensional form by defining z = w/wpe, € = m-/m,, and 
à = k- up/Wpe, giving 
€ 1 
z + Ge (5.14) 
The value of A at which onset of instability occurs can be seen by plotting the right hand 
side of Eq.(5.14) versus z. The first term €/z? diverges at z = 0, while the second term 
diverges at z = A. Between z = O and z = A, the right hand side of Eq.(5.14) has a 
minimum. If the value of the right hand side at this minimum is below unity, there will be 
two places between z = 0 and z = A where the right hand side of Eq.(5.14) equals unity. 
For z > A, there is always one and only one place where the right hand side equals unity 
and similarly for z < O there is one and only one place where the right hand side equals 
unity. If the minimum of the right hand side drops below unity, then Eq.(5.14) has four real 
roots, but if the minimum of the right hand side is above unity there are only two real roots 
(those in the regions z > A and z < 0). In this latter case the other two roots of this quartic 
equation must be complex. 

Because a quartic equation must be expressible in the form 


Hee 


(z — z1)(z — z2)(z — z3)(z — 24) =0 (5.15) 


and because the coefficients of Eq.(5.14) are real, the two complex roots must be complex 
conjugates of each other. To see this, suppose the complex roots are zı and z2 and the real 
roots are z3 and z4. The product of the first two factors in Eq.(5.15) is Ze (21 4+-22)z+21 29; 
if the complex roots are not complex conjugates of each other then this product will contain 
complex coefficients and, when multiplied with the product of the terms involving the real 
roots, will result in an equation that contains complex coefficients. However, Eq.(5.14) has 
only real coefficients so the two complex roots must be complex conjugates of each other. 
The complex root with positive imaginary part will give rise to instability. 

Thus, when the minimum of the right hand side of Eq.(5.14) is greater than unity, two 
of the roots become complex, and one of these complex roots gives instability. The on- 
set of instability occurs when the minimum of the right hand side Eq.(5.14) equals unity. 
Straightforward analysis (cf. assignments) shows this occurs when 


A= (1 + e)’, (5.16) 
i.e., instability starts when 
3/2 


m, \ 1⁄3 
14 (=) | (5.17) 
Mi 


The maximum growth rate of the instability may be found by solving Eq.(5.14) for \ and 
then simplifying the resulting expression using € as a small parameter. The details of this 


k- uo = Wye 
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are worked out in the assignments showing that the maximum growth rate is 


3 . 1/3 
max w; ~ ae (z) Wpe (5.18) 
which occurs when 
k - Uo & Wye. (5.19) 


Again this is a very fast growing instability, about one order of magnitude smaller than the 
electron plasma frequency. 

Streaming instabilities are a reason why certain simple proposed methods for attaining 
thermonuclear fusion will not work. These methods involve shooting an energetic deu- 
terium beam at an oppositely directed energetic tritium beam with the expectation that 
collisions between the two beams would produce fusion reactions. However, such a system 
is extremely unstable with respect to the two-stream instability. This instability typically 
has a growth rate much faster than the fusion reaction rate and so will destroy the beams 
before significant fusion reactions can occur. 


5.2 The Landau problem 


A plasma wave behavior that is both of great philosophical interest and great practical 
importance can now be investigated. Before doing so, three seemingly disconnected results 
obtained thus far should be mentioned, namely: 

1. When the exchange of energy between charged particles and a simple one-dimensional 
wave having dependence ~ exp(ikx — iwt) was considered, the particles were catego- 
rized into two general classes, trapped and untrapped, and it was found that untrapped 
particles tended to be dragged toward the wave phase velocity. Thus, untrapped par- 
ticles moving slower than the wave gain kinetic energy, whereas those moving faster 
lose kinetic energy. This has the consequence that if there are more slow than fast 
particles, the particles gain net kinetic energy overall and this gain presumably comes 
at the expense of the wave. Conversely if there are more fast than slow particles, net 
energy flows from the particles to the wave. 


2. When electrostatic plasma waves in an unmagnetized, uniform, stationary plasma 
were considered it was found that wave behavior is characterized by a dispersion re- 
lation 1+xe(w, k)+x;(w, k) = 0, where x, (w, k) is the susceptibility of each species 
o. These susceptibilities had simple limiting forms when w/k << \/KT,9/m, (isother- 
mal limit) and when w/k >> \/KT50/m,> (adiabatic limit), but the fluid analysis 
failed when w/k ~ \/KT50/m, and the susceptibilities became undefined. 


3. When the behavior of interacting beams of particles was considered, it was found that 
under certain conditions a fast growing instability would develop. 
These three results will be tied together by the analysis of the Landau problem. 


5.2.1 Attempt to solve the linearized Vlasov-Poisson system of equations using 
Fourier analysis 


The method for manipulating fluid equations to find wave solutions was as follows: (i) 
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the relevant fluid equations were linearized, (ii) a perturbation ~ exp(ik- x — iwt) was 
assumed, (ili) the system of partial differential equations was transformed into a system 
of algebraic equations, and then finally (iv) the roots of the determinant of the system of 
algebraic equations provided the dispersion relations which characterized the various wave 
solutions. 

It seems reasonable to use this method again in order to investigate waves from the 
Vlasov point of view. However, it will be seen that this approach fails and that instead, a 
more complicated Laplace transform technique must be used in the Vlasov context. How- 
ever, once the underlying difference between the Laplace and Fourier transform techniques 
has been identified, it is possible to go back and “patch up” the Fourier technique. Al- 
though perhaps not entirely elegant, this patching approach turns out to be a reasonable 
compromise that incorporates both the simplicity of the Fourier method and the correct 
mathematics/physics of the Laplace method. 

The Fourier method will now be presented and, to highlight how this method fails, the 
simplest relevant example will be considered, namely a one dimensional, unmagnetized 
plasma with a stationary Maxwellian equilibrium. The ions are assumed to be so massive 
as to be immobile and the ion density is assumed to equal the electron equilibrium density. 
The electrostatic electric field Æ = —0¢/0z is therefore zero in equilibrium because there 
is charge neutrality in equilibrium. Since ions do not move there is no need to track ion 
dynamics. Thus, all perturbed quantities refer to electrons and so it is redundant to label 
these with a subscript “e”. In order to have a well-defined, physically meaningful problem, 
the equilibrium electron velocity distribution is assumed to be Maxwellian, i.e., 


1 


SiT 5.20 
GET ( i ) 


fo(v) = 
where vp = \/2KT/m. 


The one dimensional, unmagnetized Vlasov equation is 


of of 4 ef o 


21 
Ot Ox = mdOx Ov 2h) 
and linearization of this equation gives 

ofi Of: — ¢ 96, Ofo _ 

at | de mor Ov 9. Ora 


Because the Vlasov equation describes evolution in phase-space, v is an independent vari- 
able just like x and t. Assuming a normal mode dependence ~ exp(ikax — iwt), Eq.(5.22) 
becomes 

q Ofo 


i(w — kv) f, —ikd, =0 (5.23) 
m Ov 
which gives 
= k q Ofo 
Deso amaA (5.24) 


The electron density perturbation is 


ST racine, e 
m=] fidv= taf ey oe (5.25) 
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a relationship between nı and ¢,. Another relationship between nı and ¢, is Poisson’s 
equation 


Poy mig 
Se 5.26 
Ox? E0 (pren 
Replacing 0/0z by ik, Eq.(5.26) becomes 
Kg, = H, (5.27) 
E0 
Combining Eqs.(5.25) and (5.27) gives the dispersion relation 
2 foe) 
q k _ Ofo 
14 dv = 0. 5.28 
k?meo ie (w— kv) Ov ý (9:20) 


This can be written more elegantly by substituting for fo using Eq.(5.20), defining the non- 
dimensional particle velocity = v/vr , and the non-dimensional phase velocity a = 
w/kvr to give 


1 1 a 1 O _22 
-— —> dé———e § =0. 5.29 
Dh T"? J-a ‘Eeo a 
or 
1+x=0 (5.30) 
where the electron susceptibility is 
1 1 S 1 O _2 
= -— > dé——.—e*. 5.31 
DP NG, el? es ‘=a 08° aie 


In contrast to the earlier two-fluid wave analysis where in effect the zeroth, first, and second 
moments of the Vlasov equation were combined (continuity equation, equation of motion, 
and equation of state), here only the Vlasov equation is involved. Thus the Vlasov equa- 
tion contains all the information of the moment equations and more. The Vlasov method 
therefore seems a simpler and more direct way for calculating the susceptibilities than the 
fluid method, except for a serious difficulty: the integral in Eq.(5.31) is mathematically 
ill-defined because the denominator vanishes when € = a (i.e., when w = kur). Be- 
cause it is not clear how to deal with this singularity, the ¢ integral cannot be evaluated and 
the Fourier method fails. This is essentially the same as the problem encountered in fluid 
analysis when w/k became comparable to ,/KT'/m. 


5.2.2 Landau method: Laplace transforms 


Landau (1946) argued that the Fourier problem as presented above is ill-posed and showed 
that the linearized Vlasov-Poisson problem should be treated as an initial value problem, 
rather than as a normal mode problem. The initial value point of view is conceptually re- 
lated to the analysis of single particle motion in sawtooth or sine waves. Before presenting 
the Landau analysis of the linearized Vlasov-Poisson problem, certain important features 
of Laplace transforms will now be reviewed. 

The Laplace transform of a function ¢ (t) is defined as 


Vp) = Í “ple Mat (5.32) 
0 
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and can be considered as a “half of a Fourier transform” since the time integration starts at 
t = O rather than t = —oo. Caution is required regarding the convergence of this integral 
for situations where Y(t) contains exponentially growing terms. 

Suppose such exponentially growing terms exist. As t — oo, the fastest growing term, 
say exp(yt), will dominate all other terms contributing to w(t). The integral in Eq.(5.32) 
will then diverge as t — oo, unless a restriction is imposed on the real part of p. In partic- 
ular, if it is required that Rep > y, then the decaying exp(—pt) factor will always over- 
whelm the growing exp(yt) factor so that the integral in Eq.(5.32) will converge. These 
issues of convergence are ignored in Fourier transforms where it is implicitly assumed that 
the function being transformed has neither exponentially growing terms (which diverge at 
t = oo) nor exponentially decaying terms (which diverge at t = —oo). 

Thus, the integral transform in Eq.(5.32) is defined only for Rep > y. To emphasize 
this restriction, Eq.(5.32) is re-written as 


dy) =f ve Pat, Rep >y (5.33) 
0 


where ¥ is the fastest growing exponential term contained in w(t). Since p is typically 
complex, Eq.(5.33) means that (p) is only defined in that part of the complex p plane 
lying to the right of y as sketched in Fig.5.2(a). Whenever Y(p) is used, one must be 
very careful to avoid venturing outside the region in p—space where v(p) is defined (this 


restriction will later become an important issue). 


To construct an inverse transform, consider the integral 


g(t) = f dp ġ(p)e”. (5.34) 


This integral is ambiguously defined for now because the integration contour C' is unspec- 
ified. However, whatever integration contour is ultimately selected must not venture into 
regions where v(p) is undefined. Thus, an allowed integration path must have Rep > y. 
Substitution of Eq.(5.33) into Eq.(5.34) and interchanging the order of integration gives 


g(t) = f dt’ Í dpy(t)ett, Rep >y. (5.35) 
(0) C 


A useful integration path C for the p integral will now be determined. Recall from the 
theory of Fourier transforms that the Dirac delta function can be expressed as 


6(t) = = J dw e*t (5.36) 


— 00 


which is an integral along the real w axis so that w is always real. The integration path 
for Eq.(5.35) will now be chosen such that the real part of p stays constant, say at a value 
B which is larger than y, while the imaginary part of p goes from —oo to oo. This path is 
shown in Fig.5.2(b), and is called the Bromwich contour. 
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Figure 5.2: Contours in complex p-plane 


For this choice of path, Eq.(5.35) becomes 


o0 B+ioco 
g(t) = fæ J d(pr + ip;) W(t er tip") 

0 B-ioo 
= i fateh y(e) J dp; eipi(t-t’) 

0 — o0 
= ani f Peet) 

0 

= 2ri(t) (5.37) 


where Eq.(5.36) has been used. Thus, y(t) = (27i)~'g(t) and so the inverse of the 
Laplace transform is 


271 aise 


1 B+ico 
v(t) f dpy(pje", B>7. (5.38) 
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Before returning to physics, recall another peculiarity of Laplace transforms, namely 
the transformation procedure for derivatives. The Laplace transform of dy/dt; may be 
simplified by integrating by parts to give 


co dy S co £ 
| dt TEF =[v(t)e”]> +p | dt w(t)e ”* = py(p) — (0). (5.39) 
0 0 


Unlike Fourier transforms, here the initial value forms part of the transform. Thus, Laplace 
transforms contain information about the initial value and so should be better suited than 
Fourier transforms for investigating initial value problems. The importance of initial value 
was also evident in the Chapter 3 analysis of particle motion in sawtooth or sine wave 
potentials. 

The requisite mathematical tools are now in hand for investigating the Vlasov-Poisson 
system and its dependence on initial value. To obtain extra insights with little additional 
effort the analysis is extended to the more general situation of a three dimensional plasma 
where ions are allowed to move. Again electrostatic waves are considered and it is assumed 
that the equilibrium plasma is stationary, spatially uniform, neutral, and unmagnetized. 

The equilibrium velocity distribution of each species is assumed to be a three dimen- 
sional Maxwellian distribution function 


3/2 
Mo 
foo(V) = noo (Ez) exp(—M,v7/2KTs). (5.40) 


The equilibrium electric field is assumed to be zero so that the equilibrium potential is 
a constant chosen to be zero. It is further assumed that at t = O there exists a small 
perturbation of the distribution function and that this perturbation evolves in time so that at 
later times 


fa(x,v,t) = foo(v) + for (x, vt). (5.41) 
The linearized Vlasov equation for each species is therefore 
Ofer o foo 


+v- Vfa a Ve- =0. (5.42) 


ot Ov 


All perturbed quantities are assumed to have the spatial dependence ~ exp(ik - x); this is 
equivalent to Fourier transforming in space. Equation (5.42) becomes 


Ofer : qo : ð foo 
a +ik -vfa A Qik- a 0. (5.43) 
Laplace transforming in time gives 
S ra im r O oO 
(p +ik-v)fo1(v,p) — for(v, 0) — a Ju (p)ik- fao =0 (5.44) 
which may be solved for falvp) to give 
ra 1 Io ~ : foo 
a] JAA p o ,0 ay k- 4 
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This is similar to Eq.(5.24), except that now the Laplace variable p occurs instead of the 
Fourier variable —iw and also the initial value f,;(v,0) appears. As before, Poisson’s 
equation can be written as 


V76, = -— S teat = = Da ufoi(X, v, t). (5.46) 


Replacing V — ik and Laplace transforming with respect to time, Poisson’s equation 
becomes 


k?ġ (p) = / dv foi (vp). (5.47) 


Substitution of Eq.(5.45) into the right hand side of Eq. (5.47) gives 


e~ 7. Ofo 
fa (v0) + 19, (pik “f° 


27 o 
k?6,(p ue [ee EAE (5.48) 


which is similar to Eq.(5.28) except that —iw — p and the initial value appears. Equation 
(5.48) may be solved for 4 (p) to give 


~ N (p) 
p (p) = —= (5.49) 
where the numerator is 
1 foilv 0) 
N = a5 o q? ; . 
(p) ke 2, : J s (p+ik- v) on 
and the denominator is 
R afso 
D(p) =1- L 5 gz [e (5.51) 
k? < como (p+ik- v) j 


Note that the denominator is similar to Eq.(5.28). All that has to be done now is take the 
inverse Laplace transform of Eq.(5.49) to obtain 


eh ff#> NOD) p 
o1(t) = 55 i, @ Dey” (5.52) 


where £ is chosen to be larger than the fastest growing exponential term in N(p)/D(p). 

This is an exact formal solution to the problem. However, because of the complexity of 
N (p) and D(p) it is impossible to evaluate the integral in Eq.(5.52). Nevertheless, it turns 
out to be feasible to evaluate the long-time asymptotic limit of this integral and for practical 
purposes, this is a sufficient answer to the problem. 
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5.2.3 The relationship between poles, exponential functions, and analytic 
continuation 


Before evaluating Eq.(5.52), it is useful to examine the relationship between exponentially 
growing/decaying functions, Laplace transforms, poles, residues, and analytic continua- 
tion. This relationship is demonstrated by considering the exponential function 


f(t) = et (5.53) 


where q is a complex constant. If the real part of q is positive, then the amplitude of f(t) 
is exponentially growing, whereas if the real part of q is negative, the amplitude of f(t) is 
exponentially decaying. Now, calculate the Laplace transform of f (t); it is 


T 99 1 
f(p) = f et P)tdt = ——, defined only for Rep > Req. (5.54) 
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Let us examine the Bromwich contour integral for fi (p) and temporarily call this integral 

F(t); evaluation of F(t) ought to yield F(t) = f(t). Thus, we define 
1 B+ioco a 

F®)=55 dpf (p), 8 > Reg. (5.55) 
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If the Bromwich contour could be closed in the left hand p plane, the integral could easily 

be evaluated using the method of residues but closure of the contour to the left is forbidden 

because of the restriction that 8 > Re q. This annoyance may be overcome by constructing 

a new function f(p) which 


1. equals F(p) in the region 6 > Req, 
2. is also defined in the region 6 < Req, and 


3. is analytic. 
Integration of f (p)along the Bromwich contour gives the same result as does inte- 
gration of f (p) along the same contour because the two functions are identical along this 
contour [cf. stipulation (1) above]. Thus, it is seen that 


1 B+ico 


dpf (p)e’, (5.56) 
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y= 2mi Jg 
but now there is no restriction on which part of the p plane may be used. So long as the 
end points are kept fixed and no poles are crossed, the path of integration of an analytic 
function can be arbitrarily deformed. This is because the difference between the original 
path and a deformed path is a closed contour which integrates to zero if it does not enclose 
any poles. Because f (p) — Oat the endpoints 8 + oo, the integration path of f (p) can be 
deformed into the left hand plane as long as f (p) remains analytic (i.e., does not jump over 
any poles or branch cuts). How can this magic function f (p) be constructed? 

The answer is simple; we define a function f. (p) having the identical functional form as 
if (p), but without the restriction that Rep > Re q. Thus, the analytic continuation of 


1 
f(p) = =a defined only for Rep > Req (5.57) 
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is simply 


1 7 
f(p) = eT defined for all p provided f(p) remains analytic. (5.58) 


The Bromwich contour can now be deformed into the left hand plane as shown in Fig. 
5.3. Because exp(pt) — 0 for positive t and negative Re p, the integration contour can be 
closed by an arc that goes to the left (cf. Fig.5.3) into the region where Re p — —oo. The 
resulting contour encircles the pole at p = q and so the integral can be evaluated using the 
method of residues as follows: 


1 1 1 
F(t) = — e”tdp = lim 2ri(p — ——— e| = et, 5.59 
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Figure 5.3: Bromwich contour 


This simple example shows that while the Bromwich contour formally gives the inverse 
Laplace inverse transform of f (p), the Bromwich contour by itself does not allow use of 
the method of residues, since the poles of interest are located precisely in the left hand 
complex p plane where f (p) is undefined. However, analytic continuation of f (p) allows 
deformation of the Bromwich contour into the formerly forbidden area, and then the inverse 
transform may be easily evaluated using the method of residues. 
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5.2.4 Asymptotic long time behavior of the potential oscillation 


We now return to the more daunting problem of evaluating Eq.(5.52). As in the simple 
example above, the goal is to close the contour to the left, but because the functions N (p) 
and D(p) are not defined for Re p < y, this is not immediately possible. It is first necessary 
to construct analytic continuations of N(p) and D(p) that extend the definition of these 
functions into regions of negative Rep. As in the simple example, the desired analytic 
continuations may be constructed by taking the same formal expressions as obtained before, 
but now extending the definition to the entire p plane with the proviso that the functions 
remain analytic as the region of definition is pushed leftwards in the p plane. 

Consider first construction of an analytic continuation for the function N (p). This func- 
tion can be written as 


F51(v4,0) a 
N(p) = ga lef aT Eee a dv) ip |B) (5.60) 


Here, || means in the k direction, and the parallel component of the initial value of the 
perturbed distribution function has been defined as 


Fo1(vy,0) = ferao. (5.61) 


The integrand in Eq.(5.50) has a pole at v} = ip/k. Let us assume that k > O (the 
general case where k can be of either sign will be left as an assignment). Before we 
construct an analytic continuation, Re p is restricted to be greater than y so that the pole 
vy = ip/kis in the upper half of the complex vj plane as shown in Fig.5.4(a). When N (p) 
is analytically continued to the left hand region, the definition of N (p) is extended to allow 
Rep to become less than y and even negative. As shown in Figs. 5.4(b), decreasing Re p 
means that the pole at vy = ip/k in Eq.(5.50) drops from its initial location in the upper 
half v) plane toward the lower half v) plane. A critical question now arises: how should 
we arrange this construction when Rep passes through zero? If the pole is allowed to 
jump from being above the path of vj integration (which is along the real vj axis) to being 
below, the function N (p) will not be analytic because it will have a discontinuous jump of 
2ri times the residue associated with the pole. Since it was stipulated that N (p) must be 
analytic, the pole cannot be allowed to jump over the vy contour of integration. Instead, 
the prescription proposed by Landau will be used which is to deform the v contour as Re p 
becomes negative so that the contour always lies below the pole; this deformation is shown 
in Figs.5.4(c). 
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Figure 5.4: Complex v plane 


D(p) involves a similar integration along the real vj axis. It also has a pole that is 
initially in the upper half plane when Rep > 0, but then drops to being below the axis as 
Re pis allowed to become negative. Thus analytic continuation of D(p) is also constructed 
by deforming the path of the v) integration so that the contour always lies below the pole. 

Equipped with these suitably constructed analytic continuations of N(p) and D(p) 
into the left-hand p plane, evaluation of Eq.(5.52) can now be undertaken. As shown in 
the simple example, it is computationally advantageous to deform the Bromwich contour 
into the left hand p-plane. The deformed contour evaluates to the same result as the orig- 
inal Bromwich contour (provided the deformation does not jump over any poles) and this 
evaluation may be accomplished via the method of residues. In the general case where 
N(p)/D(p) has several poles in the left hand p plane, then as shown in Fig.5.2(c), the 
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contour may be deformed so that the vertical portion is pushed to the far left, except 
where there is a pole p;; the contour “snags” around each pole p; as shown in Fig.5.2(c). 
For Rep — —oo, the numerator N(p) — 0, while the denominator D(p) — 1. Since 
exp(pt) — 0 for Re p — —co and positive t, the left hand vertical line does not contribute 
to the integral and Eq.(5.52) simply consists of the sum of the residues of all the poles, 
i.e. 


. N(p) 
di (t) = a je -2) Bo) e| (5.62) 
Where do the poles p; come from? Upon examining Eq.(5.62), it is clear that poles could 
come either from (i) N (p) having an explicit pole, i.e. N (p) contains a term ~ 1/(p—p,), 
or (ii) from D(p) containing a factor ~ (p — p;), i.e., pj is a root of the equation D(p) = 
0. The integrand in Eq. (5.60) has a pole in the vj plane; this pole is “used up” as a 
residue upon performing the vy integration, and so does not contribute a pole to N (p). 
The only other possibility is that the initial value F’,(v\,0) somehow provides a pole, but 
F,1(v),0) is a physical quantity with a bounded integral [i.e., f F51(vy,0)dv, is finite] and 
so cannot contribute a pole in N (p). It is therefore concluded that all poles in N(p)/D(p) 
must come from the roots (also called zeros) of D(p). 

The problem can be simplified by deciding to be content with a less than complete so- 
lution. Instead of attempting to calculate ¢,(¢) for all positive times (i.e., all the poles 
pj contribute to the solution), we restrict ourselves to the less burdensome problem of find- 
ing the long time asymptotic behavior of ¢, (t). Because each term in Eq.(5.62) has a factor 
exp(ip;t), the least damped term [i.e., the term with pole furthest to the right in Fig.5.2(c)], 
will dominate all the other terms at large t. Hence, in order to find the long-term asymptotic 
behavior, all that is required is to find the root p; having the largest real part. 

The problem is thus reduced to finding the roots of D(p); this requires performing the 
vj integration sketched in Fig.5.4. Before doing this, it is convenient to integrate out the 
perpendicular velocity dependence from D(p) so that 


2 i fso 
1 ; 
D(p) Zi - qo dv - ðv 
k? L Eome (p +ik-v) 
OF 50 
1 lA Ov| 
= 1 = g 5.63 
k? 4 Eome S "Noj = ip/k) mee 
Thus, the relation D(p) = 0 can be written in terms of susceptibilities as 
Dip) =1+Xi+Xe =0 (5.64) 


since the quantities being summed in Eq.(5.63) are essentially the electron and ion pertur- 
bations associated with the oscillation, and D(p) is the Laplace transform analog of the 
the Fourier transform of Poisson’s equation. In the special case where the equilibrium dis- 
tribution function is Maxwellian, the susceptibilities can be written in a standardized form 
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as 
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k2r2,, | whl? (€ — a) 
1 
= sy ll+aZ(a)] (5.65) 
Do 


where œ = ip/kvro, and the last line introduces the plasma dispersion function Z (a) 
defined as 


g 2 
jja J Tea (5.66) 


where the € integration path is under the dropped pole. 


5.2.5 Evaluation of the plasma dispersion function 


If the pole corresponding to the fastest growing (i.e., least damped) mode turns out to have 
dropped well below the real axis (corresponding to Re p being large and negative), the 
fastest growing mode would be highly damped. We argue that this does not happen be- 
cause there ought to be a correspondence between the Vlasov and fluid models in regimes 
where both are valid. Since the fluid model indicated the existence of undamped plasma 
waves when w/k was much larger than the thermal velocity, the Vlasov model should pre- 
dict nearly the same wave in this regime. The fluid wave model had no damping and 
so any damping introduced by the Vlasov model should be weak in order to maintain an 
approximate correspondence between fluid and Vlasov models. The Vlasov solution cor- 
responding to the fluid mode can therefore have a pole only slightly below the real axis, 
i.e., only slightly negative. In this case, it is only necessary to analytically continue the de- 
finition of N(p)/D(p) slightly into the negative p plane. Thus, the pole in Eq.(5.66) drops 
only slightly below the real axis as shown in Fig.5.5. 

The € integration contour can therefore be divided into three portions, namely (1) from 
E =—oo to £ = a — 6, just to the left of the pole; (ii) a counterclockwise semicircle of 
radius 6 half way around and under the pole [cf. Fig.5.5]; and (iii) a straight line from 
a+6 to +00. The sum of the straight line segments (i) and (iii) in the limit 6 — 0 is called 
the principle part of the integral and is denoted by a ‘P’ in front of the integral sign. The 
semicircle portion is half a residue and so makes a contribution that is just mi times the 
residue (rather than the standard 27i for a complete residue). Hence, the plasma dispersion 
function for a pole slightly below the real axis is 
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CO 


Z(a) = <5 p f ope? + in"? exp(—a”) (5.67) 


where P means principle part of the integral. Equation (5.67) prescribes how to evaluate 
ill-defined integrals of the type we first noted in Eq.(5.28). 


Fre 


complex ğ plane 


integration contour 


Figure 5.5: Contour for evaluating plasma dispersion function 


There are two important limiting situations for Z (œ), namely |a| >> 1 (correspond- 
ing to the adiabatic fluid limit since w/k >> vro) and |a| << 1 (corresponding to the 
isothermal fluid limit since w/k << vre). Asymptotic evaluations of Z (œ) are possible in 
both cases and are found as follows: 

1. œ >> 1 case. 


Here, it is noted that the factor exp(—€”) contributes significantly to the integral 
only when £ is of order unity or smaller. In the important part of the integral where 
this exponential term is finite, |a| >> &. In this region of £ the other factor in the 
integrand can be expanded as 


ae-b(t-8) "tf 
(E-a) a a a| a (a) a) a] U| 
(5.68) 
The expansion is carried to fourth order because of numerous cancellations that elim- 
inate several of the lower order terms. Substitution of Eq.(5.68) into the integral in 


Eq.(5.67) and noting that all odd terms in Eq.(5.68) do not contribute to the integral 
because the rest of the integrand is even gives 


1 7 exp(—€7) _ 1 1 7 5 EN? EN 
Pap fa (E-a) ~~ ant? J eol al: t (£) } (£) Fad. 


— co 
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The ‘P’ has been dropped from the right hand side of Eq.(5.69) because there is no 
longer any problem with a singularity. These Gaussian-type integrals may be evalu- 
ated by taking successive derivatives with respect to a of the Gaussian 


1 1 
and then setting a = 1. Thus, 


1 1 3 
aa [se Eya o qi? fat oe) =i (5.71) 


so Eq.(5.69) becomes 
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In summary, for |a| >> 1, the plasma dispersion function has the asymptotic form 
l | 1 | 3 | | 3771/2 2 
Z(a) = —- h So ae " + in!/? exp(—a?). (5.73) 


ja| << 1 case. 


In order to evaluate the principle part integral in this regime the variable n = € — ais 
introduced so that dn = d&. The integral may be evaluated as follows: 
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(5.74) 
where in the third line all odd terms from the second line integrated to zero due to 
their symmetry. Thus, for œ << 1, the plasma dispersion function has the asymptotic 
limit 


2 
Z(a) = —2a (1 ee ka) hirt? exp(—a?). (5.75) 
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5.2.6 Landau damping of electron plasma waves 
The plasma susceptibilities given by Eq.(5.65) can now be evaluated. For |a| >> 1, using 


Eq.(5.73), and introducing the “frequency” w = ip so that a = w/kvro and a; = w; /kuTo 
the susceptibility is seen to be 


1 l | 1 ( 1 3 ) i 2 
Xe = = l+a 14 + —— +... ] +in!/? exp(—a?) 
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Do 
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(5.76) 
Thus, if the root is such that |a| >> 1, the equation for the poles D(p) = 1 +X; + Xe = 0 
becomes 


Wee k? KTe a g2 

1 P (1 + 3 a} H i rS, exp(—w? /k? v2...) 
w2; k2 KT; w ml 

p (1 | a) H T Is, exp(—w?/k?u2,,) =0. (5.77) 
This expression is similar to the previously obtained fluid dispersion relation, Eq. (4.31), 
but contains additional imaginary terms that did not exist in the fluid dispersion. Further- 
more, Eq.(5.77) is not actually a dispersion relation. Instead, it is to be understood as the 
equation for the roots of D(p). These roots determine the poles in N(p)/D(p) producing 
the least damped oscillations resulting from some prescribed initial perturbation of the dis- 
tribution function. Since Ww, / Ww? = Mi /Me and in general vp; << vre, both the real and 
imaginary parts of the ion terms are much smaller than the corresponding electron terms. 
On dropping the ion terms, the expression becomes 


we, k? KT. wml? 
1 P (1 3 aa a) l ee I>, exp(—w?/k?u2,,) = 0. (5.78) 
Recalling that w = ip is complex, we write w = wrp + iw; and then proceed to find the 
complex w that is the root of Eq.(5.78). Although it would not be particularly difficult to 
simply substitute w = wr + iw; into Eq.(5.78) and then manipulate the coupled real and 
imaginary parts of this equation to solve for w, and w;, it is better to take this analysis as 
an opportunity to introduce a more general way for solving equations of this sort. 
Equation (5.78) can be written as 


D(wr + iwi) = Dr (wr + iw;) + iD; (w, +iw;) =0 (5.79) 


where D, is the part of D that does not explicitly contain i and D; is the part that does 
explicitly contain i. Thus 


w? k? KT, w n! 
Z pe i cae a sg 27,2 
D, =1 72 (1 H 32 ae a) a aD kore FXG, exp(—w"/k"uz,). (5.80) 
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Since the oscillation has been assumed to be weakly damped, w; << w, and so Eq.(5.79) 
can be Taylor expanded in the small quantity w;, 


. dD, $ ; dD; E 
D, (wr) + iw; ( T J Hi [oe +iwi ( a Jal =0. (5.81) 


Since w; << wy, the real part of Eq.(5.81) is 


D, (wr) = 0. (5.82) 

Balancing the two imaginary terms in Eq.(5.81) gives 

Di (wr) 
wi = —-Jp, (5.83) 
dw 
Thus, Eqs.(5.82) and (5.80) give the real part of the frequency as 
k? KT, , 
eer e\ 2 252 

We = We (1 + a ) ~ We (1 + 3k7Az,) (5.84) 


while Eqs.(5.83) and (5.80) give the imaginary part of the frequency which is called the 
Landau damping as 


W= i = ae exp ( w?/k?v2.,) 
5 Ade (5.85) 
e l 2 232 
= 3 Br, exp |- (1+ 3k?Ap,) /2k? Ape] - 
Since the least damped oscillation goes as exp(pt) = exp(—iwt) = — exp(—i(w, + 


iw;)t) = exp(—iw,t +w,t) and Eq.(5.85) gives a negative w; , this is indeed a damping. It 
is interesting to note that while Landau damping was proposed theoretically by Landau in 
1949, it took sixteen years before Landau damping was verified experimentally (Malmberg 
and Wharton 1964). 

What is meant by weak damping v. strong damping? In order to calculate w; it was 
assumed that w; is small compared to w, suggesting perhaps that w; is unimportant. How- 
ever, even though small, w; can be important, because the factor 27 affects the real and 
imaginary parts of the wave phase differently. Suppose for example that the imaginary part 
of the frequency is 1/27 ~ 1/6 the magnitude of the real part. This ratio is surely small 
enough to justify the Taylor expansion used in Eq.(5.81) and also to justify the assumption 
that the pole p; corresponding to this mode is only slightly to the left of the imaginary p 
axis. Let us calculate how much the wave is attenuated in one period T = 27/w,. This 
attenuation will be exp(—|w;|7) = exp(—27/6) ~ exp(—1) ~ 0.3. Thus, the wave ampli- 
tude decays to one third its original value in just one period, which is certainly important. 


5.2.7 Power relationships 


It is premature to calculate the power associated with wave damping, because we do not yet 
know how to add up all the energy in the wave. Nevertheless, if we are willing to assume 
temporarily that the wave energy is entirely in the wave electric field (it turns out there is 
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also energy in coherent particle motion - to be discussed in Chapter 14), it is seen that the 
power being lost from the wave electric field is 


d E? d F? ileo E2 
Pca éiast AS £0 wave Ky Eo] adë exp ( 2|w;|t) so jw [Eo wave 
dt 2 dt 4 2 
— {i _ wpe 2 p22 2 
g 2K3AS exp (—w /k Te) EOL vave 
(5.86) 
where (E2ave) = |Ewave|” (cos(ka — wt)) = |Ewave|”/2 has been used. However, in 
Sec.3.8, it was shown that the energy gained by untrapped resonant particles in a wave is 
2 
TMW qEwave d 
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(5.87) 


using w ~ Wye this is seen to be the same as Eq.(5.86) except for a factor of two. We shall 
see later that this factor of two comes from the fact that the wave electric field actually 
contains half the energy of the electron plasma wave, with the other half in coherent particle 
motion, so the true power loss rate is really twice that given in Eq.(5.86). 


5.2.8 Landau damping for ion acoustic waves 


Ion acoustic waves resulted from a two-fluid analysis in the regime where the wave phase 
velocity was intermediate between the electron and ion thermal velocities. In this situation 
the electrons behave isothermally and the ions behave adiabatically. This suggests there 
might be another root of D(p) if |ae| << 1 and |a;| >> 1 or equivalently vr; << 
w/k << vre. From Eqs.(5.65) and (5.75), the susceptibility for ja] << 1 is found to be 
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Using Eq.(5.88) for the electron susceptibility and Eq.(5.76) for the ion susceptibility 
gives 
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On applying the Taylor expansion technique discussed in conjunction with Eqs.(5.82) and 
(5.83) we find that w, is the root of 
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Here, as in the two-fluid analysis of ion acoustic waves, c = Ww ADe = KTe /m; has been 
defined. The imaginary part of the frequency is found to be 
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(5.92) 
The dominant Landau damping comes from the ions, since the electron Landau damping 
term has the small factor \/m,./m;. If Te >> T; the ion term also becomes small because 
x?/2 exp(—x) — 0 as z becomes large. Hence, strong ion Landau damping occurs when 
T; approaches Te and so ion acoustic waves can only propagate without extreme attenu- 
ation if the plasma has T. >> T;. Landau damping of ion acoustic waves was observed 
experimentally by Wong, Motley and D’ Angelo (1964). 


5.2.9 The Plemelj formula 


The Landau method showed that the correct way to analyze problems that lead to ill-defined 
integrals such as Eq.(5.31) is to pose the problem as an initial value problem rather than as 
a steady-state situation. The essential result of the Landau method can be summarized by 
the Plemelj formula 


. 1 
eagle) Gee 
which is a prescription showing how to deal with singular integrands of the form appearing 
in the plasma dispersion function. From now on, instead of repeating the lengthy Laplace 
transform analysis, we instead will use the less cumbersome, but formally incorrect Fourier 
method and then invoke Eq.(5.93) as a ‘patch’ to resolve any ambiguities regarding inte- 
gration contours. 


+ ind(€ — a) (5.93) 


172 Chapter 5. Streaming instabilities and the Landau problem 


5.3 The Penrose criterion 


The analysis so far showed that electrostatic plasma waves are subject to Landau damping, 
a collisionless attenuation proportional to [ð f/v], u /k> and that this damping is con- 
sistent with the calculation of power input to particles by an electrostatic wave. Since a 
Maxwellian distribution function has a negative slope, its associated Landau damping is al- 
ways a true wave damping. This is consistent with the physical picture developed in the 
single particle analysis which showed that energy is transferred from wave to particles if 
there are more slow than fast particles in the vicinity of the wave phase velocity. What 
happens if there is a non-Maxwellian distribution function, in particular one where there 
are more fast particles than slow particles in the vicinity of the wave phase velocity, i.e., 
[Of /O0] -w/k > 0? Because f(v) — 0 as v — ov, f can only have a positive slope for 
a finite range of velocity; i.e., positive slopes of the distribution function must always be 
located to the left of a localized maximum in f(v). A localized maximum in f(v) corre- 
sponds to a beam of fast particles superimposed on a (possible) background of particles 
having a monotonically decreasing f(v). Can the Landau damping process be run in re- 
verse and so provide Landau growth, i.e., wave instability? The answer is yes. We will 
now discuss a criterion due to Penrose (1960) that shows how strong a beam must be to 
give Landau instability. 
The procedure used to derive Eq.(5.28) is repeated, giving 


2 foe) 
q k Of 
14 dv =0 5.94 
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which may be recast as 
kK? = Q(z) (5.95) 
where 4 af 
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is a complex function of the complex variable z = w/k. The wavenumber k is assumed to 
be a positive real quantity and the Plemelj formula will be used to resolve the ambiguity 
due to the singularity in the integrand. 

The left hand side of Eq.(5.95) is, by assumption, always real and positive for any choice 
of k. A solution of this equation can therefore always be found if Q(z) is simultaneously 
pure real and positive. The actual magnitude of Q(z) does not matter, since the magnitude 
of k? can be adjusted to match the magnitude of Q(z). 

The function Q(z) may be interpreted as a mapping from the complex z plane to the 
complex Q plane. Because solutions of Eq.(5.95) giving instability are those for which 
Imw > 0, the upper half of the complex z plane corresponds to instability and the real z 
axis represents the dividing line between stability and instability. Let us consider a straight- 
line contour C, parallel to the real z axis, and slightly above. As shown in Fig.5.6(a) this 
contour can be prescribed as z = z, + iô where 6 is a small constant and z, ranges ranges 
from —oo to +00. 

The function Q(z) — 0 when z — o0 and so, as z is moved along the C, contour, 
the corresponding path Cg traced in the Q plane must start at the origin and end at the 
origin. Furthermore, since Q can be evaluated using the Plemelj formula, it is seen that 
Q is finite for all z on the path C’,. Thus, Cg is a continuous finite curve starting at the 
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Q-plane origin and ending at this same origin as shown by the various possible mappings 


sketched in Figs.5.6(b), (c) and (d). 
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Figure 5.6: Penrose criterion: (a)-(d) mappings, (e) instability criterion. 


The upper half z plane maps to the area inside the curve Co. If Cg is of the form 


shown in Fig.5.6(b), then Q(z) never takes on a positive real value for z being in the upper 
half z plane; thus a curve of this form cannot give a solution to Eq.(5.95) corresponding 
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to an instability. However, curves of the form sketched in Figs.5.6 (c) and (d) do have 
Q(z) taking on positive real values and so do correspond to unstable solutions. Marginally 
unstable situations correspond to where Co crosses the positive real Q axis, since Cg is a 
mapping of C, which was the set of marginally unstable frequencies. 

Let us therefore focus attention on what happens when Cg crosses the positive real Q 
axis. Using the Plemelj formula on Eq.(5.96) it is seen that 


A o 
ImQ = I r [22| 
ðv v=w,/k 


MEQ 
and, on moving along Cg from a point just below the real Q axis to just above the real 
Q axis, IMQ goes from being negative to positive. Thus, [3 fo/ Wy =u, jx Changes from 
being negative to positive, so that on the positive real Q axis fo is a minimum at some value 
V = Vmin (here the subscript “min” means the value of v for which fo is at a minimum and 
not v itself is at a minimum). A Taylor expansion about this minimum gives 


(5.97) 
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Since f (Vmin) is a constant, it is permissible to write 
Ofo 8 
ðv T av [fo(v) fo(Umin) | - (5.99) 


This innocuous insertion of f (Umin) makes it easy to integrate Eq. (5.96) by parts 


2 y [f(v) E f(Umin)] a 
Q(z = min) = pf aw Ov Cana HiT Fa) o 
2 oo 
= ae a (v — = [f(v) f(vmin)] 
pE 4° i i (v — Umin) i 
7 nal (U — Umin) fo ! 5S a l; 
(5.100) 


in the second line advantage has been taken of the fact that the imaginary part is zero by 
assumption, and in the third line the ‘P’ for principle part has been dropped because there 
is no longer a singularity at v = Vmin. In fact, since the leading term of f(v) — f (Vmin) is 
proportional to (v — Vmin )?, this qualifying ‘P’ can also be dropped from the second line. 
The requirement for marginal instability can be summarized as: f(v) has a minimum at 
UV = Vmin, and the value of Q is positive, i.e., 


Q(Umin) = as ne dv E >0. (5.101) 


MEQ 


— o0 
This is just a weighted measure of the strength of the bump in f located to the right of the 
minimum as shown in Fig.5.6(e). The hatched areas with horizontal lines make positive 
contributions to Q, while the hatched areas with vertical lines make negative contributions. 
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These contributions are weighted according to how far they are from Vmin by the factor 
(v = Üa) 

The Penrose criterion extends the 2-stream instability analysis to an arbitrary distribu- 
tion function containing finite temperature beams. 


5.4 Assignments 


1. Show that the electrostatic dispersion relation for electrons streaming through ions 
with velocity vo through stationary ions is 


1,2 2 
l Wpi We 


w? (w—k-vo)? — 


(a) Show that instability begins when 


(b) Split the frequency into its real and imaginary parts so that w = wp + iw;. Show 
that the instability has maximum growth rate 


wi vV3Bfm\ 
We 2 (35) i 

What is the value of kvo/wpe when the instability has maximum growth rate. 
Sketch the dependence of w;/wpe on kvo/wpe.(Hint- define non-dimensional 
variables € = Me/Mi, Z = W/Wpe, and AX = kup /Wye. Let z = x + iy and 
look for the maximum y satisfying the dispersion. A particularly neat way to 
solve the dispersion is to solve the dispersion for the imaginary part of A which 
of course is zero, since by assumption k is real. Take advantage of the fact that 
€ << 1 to find a relatively simple expression involving y. Maximize y with 
respect to x and then find the respective values of x,y, and A at this point of 
maximum y. 


2. Prove the Plemelj formula. 
3. Suppose that 
E(x,t) = J E(k)e* ie tak (5.102) 
where 
w = w,(k) + iw;(k) 
is determined by an appropriate dispersion relation. Assuming that E(x,t) is a real 
quantity, show by comparing Eq.(5.102) to its complex conjugate, that w,.(k) must 
always be an odd function of k while w; must always be an even function of k. 


(Hint- Note that the left hand side of Eq.(5.102) is real by assumption, and so the right 
hand side must also be real. Take the complex conjugate of both sides and replace 
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the dummy variable of integration k by —k so that dk — —dk and the +co limits of 
integration are also interchanged). 


. Plot the real and imaginary parts of the plasma dispersion function. Plot the real and 


imaginary parts of the susceptibilities. 


. Is it possible to have electrostatic plasma waves with kApe >> 1. Hint, consider 


Landau damping. 


. Plot the potential versus time in units of the real period of an electron plasma wave for 


various values of w/k4/KTe/Me showing the onset of Landau damping. 


. Plot w;/w, for ion acoustic waves for various values of T./T; and show that these 


waves have strong Landau damping when the ion temperature approaches the electron 
temperature. 


. Landau instability for ion acoustic waves- Plasmas with T, > T; support propagation 


of ion acoustic waves; these waves are Landau damped by both electrons and ions. 
However, if there is a sufficiently strong current J flowing in the plasma giving a 
relative streaming velocity uo = J/ne between the ions and electrons, the Landau 
damping can operate in reverse, and give a Landau growth. This can be seen by 
moving to the ion frame in which case the electrons appear as an offset Gaussian. If 
the offset is large enough it will be possible to have [0f./ OU] yn, > 0, giving more 
fast than slow particles at the wave phase velocity. Now, since fe is a Gaussian with 
its center shifted to be at wo, show that if uo > w/k the portion of fe immediately 
to the left of uo will have positive slope and so lead to instability. These qualitative 
ideas can easily be made quantitative, by considering a 1-D equilibrium where the ion 
distribution is tid Je 
fio = ET ae 
and the drifting electron distribution is 


no 


2 2 
e7 (0-0)? /v ee 


feo = 


The ion susceptibility will be the same as before, but to determine the electron sus- 
ceptibility we must reconsider the linearized Vlasov equation 
| 


Ofer pose de Oo, o o e70- uo)?/vFe = 0. 
ot x Me Ox Ov | T/2uUP7. 


T/2ure 


This equation can be simplified by defining v’ = v — uo, Show that the electron 
susceptibility becomes 


1 
e= aw |l +aZ(a 
X © ys i [ (a)] 
where now a = (w — kuo)/kure. Suppose Te >> T; so that the electron Landau 
damping term dominates. Show that if uo > wr /k the electron imaginary term will 
reverse sign and give instability. 


. Suppose a current J flows in a long cylindrical plasma of radius a, density n, ion mass 


mi; for which Te >> T;. Write a criterion for ion acoustic instability in terms of an 
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appropriate subset of these parameters. Suppose a cylindrical mercury plasma with 
Te = 1—2eV, T; = 0.1 eV, diameter 2.5 cm, carries a current of J = 0.35 amps. 
At what density would an ion acoustic instability be expected to develop. Does this 
configuration remind you of an everyday object? Hint: there are several hanging from 
the ceiling of virtually every classroom. 


Cold plasma waves in a magnetized plasma 


Chapter 4 showed that finite temperature is responsible for the lowest order dispersive 
terms in both electron plasma waves [dispersion w? = we + 3k?KT, /me] and ion acoustic 
waves [dispersion w? = k?c2/(1 + k?A7,.)]. Furthermore, finite temperature was shown 
in Chapter 5 to be essential to Landau damping and instability. 

Chapter 4 also contained a derivation of the electromagnetic plasma wave [dispersion 
wW? = w?, +k?c?] and of the inertial Alfvén wave [dispersion w? = k2u%/(1+k2c? /w?.)], 
both of which had no dependence on temperature. To distinguish waves which depend 
on temperature from waves which do not, the terminology “cold plasma wave” and “hot 
plasma wave” is used. A cold plasma wave is a wave having a temperature-independent 
dispersion relation so that the temperature could be set to zero without changing the wave, 
whereas a hot plasma wave has a temperature-dependent dispersion relation. Thus hot 
and cold do not refer to a ‘temperature’ of the wave, but rather to the wave’s dependence 
or lack thereof on plasma temperature. Generally speaking, cold plasma waves are just 
the consequence of a large number of particles having identical Hamiltonian-Lagrangian 
dynamics whereas hot plasma waves involve different groups of particles having different 
dynamics because they have different initial velocities. Thus hot plasma waves involve 
statistical mechanical or thermodynamic considerations. The general theory of cold plasma 
waves in a uniformly magnetized plasma is presented in this chapter and hot plasma waves 
will be discussed in later chapters. 


6.1 Redundancy of Poisson’s equation in electromagnetic 
mode analysis 


When electrostatic waves were examined in Chapter 4 it was seen that the plasma response 
to the wave electric field could be expressed as a sum of susceptibilities where the sus- 
ceptibility of each species was proportional to the density perturbation of that species. 
Combining the susceptibilities with Poisson’s equation gave a dispersion relation. How- 
ever, because electric fields can also be generated inductively, electrostatic waves are not 
the only type of wave. Inductive electric fields result from time-dependent currents, i.e., 
from charged particle acceleration, and do not involve density perturbations. As an exam- 
ple, the electromagnetic plasma wave involved inductive rather than electrostatic electric 
fields. The inertial Alfvén wave involved inductive electric fields in the parallel direction 
and electrostatic electric fields in the perpendicular direction. 
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One might expect that a procedure analogous to the previous derivation of electrostatic 
susceptibilities could be used to derive inductive “susceptibilities” which would then be 
used to construct dispersion relations for inductive modes. It turns out that such a procedure 
not only gives dispersion relations for inductive modes, but also includes the electrostatic 
modes. Thus, it turns out to be unnecessary to analyze electrostatic modes separately. The 
main reason for investigating electrostatic modes separately as done earlier is pedagogical 
— it is easier to understand a simpler system. To see why electrostatic modes are auto- 
matically included in an electromagnetic analysis, consider the interrelationship between 
Poisson’s equation, Ampere’s law, and a charge-weighted summation of the two-fluid con- 
tinuity equation, 


1 
V-E=-— odo; l 
= 2, noq (6.1) 
OE 
V x B = uoJ + Eoo r’ (6.2) 
Ong O 
Zo FE +V- (nev¥e)] = af 2a Ne Me +V-J=0. (6.3) 
The divergence of Eq.(6.2) gives 
fê) 
E vie 0 (6.4) 
and substituting Eq.(6.3) gives 
f8) 
F -Zra +&V-E| =0 (6.5) 


which is just the time derivative of Poisson’s equation. Integrating Eq.(6.5) shows that 


— >D Node + €0V - E =const. (6.6) 


Poisson’s equation, Eq.(6.1), thus provides an initial condition which fixes the value of 
the constant in Eq.(6.6). Since all small-amplitude perturbations are assumed to have the 
phase dependence exp(ik - x — iwt) and therefore behave as a single Fourier mode, the 
0/Ot operator in Eq. (6.5) is replaced by —iw in which case the constant in Eq.(6.6) is 
automatically set to zero, making a separate consideration of Poisson’s equation redundant. 
In summary, the Fourier-transformed Ampere’s law effectively embeds Poisson’s equation 
and so a discussion of waves based solely on currents describes inductive, electrostatic 
modes and also contains modes involving a mixture of inductive and electrostatic electric 
fields such as the inertial Alfvén wave. 


6.2 Dielectric tensor 
Section 3.8 showed that a single particle immersed in a constant, uniform equilibrium mag- 
netic field B =Bo2Z and subject to a small-amplitude wave with electric field ~ exp(ik - x— 


iwt) has the velocity 


E, iwo 2XE 


ik- x—iwt 
- e 3 6.7 
WMo 1 — w/w? w 1- w/w? (6.7) 
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The tilde ~ denotes a small-amplitude oscillatory quantity with space-time dependence 
exp(ik - x —iwt); this phase factor may or may not be explicitly written, but should always 
be understood to exist for a tilde-denoted quantity. 

The three terms in Eq.(6.7) are respectively: 

1. The parallel quiver velocity- this quiver velocity is the same as the quiver velocity of 
an unmagnetized particle, but is restricted to parallel motion. Because the magnetic 
force q(v x B) vanishes for motion along the magnetic field, motion parallel to B in 
a magnetized plasma is identical to motion in an unmagnetized plasma . 


2. The generalized polarization drift- this motion has a resonance at the cyclotron fre- 
quency but at low frequencies such that w << Weg, it reduces to the polarization drift 
Vio = MoE, / qo B? derived in Chapter 3. 


3. The generalized E x B drift- this also has a resonance at the cyclotron frequency and 
for w << Weg reduces to the drift vg = E x B/B 2 derived in Chapter 3. 
The particle velocities given by Eq.(6.7) produce a plasma current density 


J = yes N00 Io Vo 


E, iwo EXE (0:8) 


w2 
ONE po mosi ik-x—iwt 
= ieo, r E24 1 u2 Ju? e . 
co 


w 1—w2,/w? 


If these plasma currents are written out explicitly, then Ampere’s law has the form 


1WEQ é) 


(6.9) 


= JE 
MoI +Ho%0 y 


< 
x 
w 
| 


; ae Ëy EL We ZXE 
= Lo ico) zZ q 2 2 2 2 
a 1 — w2, /w w 1— w2 /w 


where a factor exp (ik - x — iwt) is implicit. 

The cold plasma wave equation is established by combining Ampere’s and Faraday’s 
law in a manner similar to the method used for vacuum electromagnetic waves. However, 
before doing so, it is useful to define the dielectric tensor K. This tensor contains the 
information in the right hand side of Eq.(6.9) so that this equation is written as 


ð 
V x B= tipo (R-E) (6.10) 
where 
2 = ; E 
> ee) ee E, Wwe ZXE 
K-E = E Z ee a TT 
E | í 1 — w2, /w? w 1- w/w? 
S —iD 0 

= |ip S 0|- (6.11) 
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and the elements of the dielectric tensor are 
w2 W w2 
S=1- — t D = z P=1— Z, (6.12 
2 w? — Wo pas w z- E 2 w? l ; 

The nomenclature S, D, P for the matrix elements was introduced by Stix (1962) and is 
a mnemonic for “Sum”, “Difference”, and “Parallel”. The reasoning behind “Sum” and 
“Difference” will become apparent later, but for now it is clear that the P element cor- 
responds to the cold-plasma limit of the parallel dielectric, i.e., P = 1 + X; + Xe where 
KS —w? /w. This is just the cold limit of the unmagnetized dielectric because behavior 
involving parallel motions in a magnetized plasma is identical to that in an unmagnetized 
plasma. In the limit of no plasma, K becomes the unit tensor and describes the effect of 
the vacuum displacement current only. 

This definition of the dielectric tensor means that Maxwell’s equations, the Lorentz 
equation, and the plasma currents can now be summarized in just two coupled equations, 
namely 


10 se 
VxB = sa (KE) (6.13) 
ðB 
CE. S a 14 
x At (6.14) 


The cold plasma wave equation is obtained by taking the curl of Eq.(6.14) and then substi- 
tuting for V x B using (6.13) to obtain 


1 @ 
Vx (VxB)=-355 (KE) (6.15) 


Since a phase dependence exp(ik - x — iwt) is assumed, this can be written in algebraic 
form as 


2 
k x (k x E) =-4 KE. (6.16) 
c 
It is now convenient to define the refractive index n = ck/w, a renormalization of the 


wavevector k arranged so that light waves have a refractive index of unity. Using this 
definition Eq.(6.16) becomes 


nn-E—-n?E+K -E=0, (6.17) 


which is essentially a set of three homogeneous equations in the three components of E. 
The refractive index n = ck/w can be decomposed into parallel and perpendicular 
components relative to the equilibrium magnetic field B = Bo 2Z. For convenience, the x axis 
of the coordinate system is defined to lie along the perpendicular component of n so that 
Ny = 0 by assumption. This simplification is possible for a spatially uniform equilibrium 
only; if the plasma is non-uniform in the x —y plane, there can be a real distinction between 
x and y direction propagation and the refractive index in the y-direction cannot be simply 
defined away by choice of coordinate system. 
To set the stage for obtaining a dispersion relation, Eq.(6.17) is written in matrix form 
as 
Sine AD NN Ex 
iD S-n? 0 -| EB, | =0 (6.18) 
NN (0) P— n2 E, 
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where, for clarity, the tildes have been dropped. It is now useful to introduce a spherical 
coordinate system in k-space (or equivalently refractive index space) with Z defining the 
axis and 0 the polar angle. Thus, the Cartesian components of the refractive index are 
related to the spherical components by 


Ne = nsind 
nN, = ncosé 
n? = nin? (6.19) 
and so Eq.(6.18) becomes 
S — n? cos? 0 —iD n? sin 0 cos 0 Ey 
iD S-n? 0 -| Ey | =0. (6.20) 
n? sinf cosð 0 P — n?sin? 0 Ey 


6.2.1 Mode behavior at 6 = 0 


Non-trivial solutions to the set of three coupled equations for Ez, Ey Æ; prescribed by 
Eq.(6.20) exist only if the determinant of the matrix vanishes. For arbitrary values of 0, 
this determinant is complicated. Rather than examining the arbitrary-@ determinant im- 
mediately, two simpler limiting cases will first be considered, namely the situations where 
0 = O(ie., k || Bo) and 6 = 7/2 (i.e., k L Bo). These special cases are simpler than the 
general case because the off-diagonal matrix elements n? sin 0 cos 0 vanish for both 0 = 0 
and 0 = 7/2. 
When 0 = 0 Eq.(6.20) becomes 


S-n? -iD 0 Ez 
iD S-n? 0 |-| B, | =0 . (6.21) 
0 0 P E 


The determinant of this system is 


[s sw = D> P20 (6.22) 
which has roots 
P=0 (6.23) 
and 
n?— S= +D. (6.24) 
Equation (6.24) may be rearranged in the form 
n? =R, =L (6.25) 
where 
R=S+D, L=S-—D (6.26) 


have the mnemonics “right” and “left”. The rationale behind the nomenclature “S(um)” 
and “Dcfference)” now becomes apparent since 


eet. ye ee (6.27) 


Ss 
Da? 2 
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What does all this algebra mean? Equation (6.25) states that for 0 = O the dispersion re- 
lation has two distinct roots, each corresponding to a natural mode (or characteristic wave) 
constituting a self-consistent solution to the Maxwell-Lorentz system. The definitions in 
Eqs.(6.12) and (6.26) show that 


w2 


ig? 
R=1 p7 L=1 i :2 
2 wlw + Wee)’ Ds w(W — Wer) ee) 


o 


so that R diverges when w = —weo Whereas L diverges when w = weg. Since Weg = 
qo B/Mo, the ion cyclotron frequency is positive and the electron cyclotron frequency is 
negative. Hence, R, diverges at the electron cyclotron frequency, whereas L diverges at the 
ion cyclotron frequency. When w — co, both R, L — 1. In the limit w — 0, evaluation of 
R, L must be done very carefully, since 


wee = No Ge Mo 
Weg EQMe qo B 
No qo 
e 6.29 
AB (6.29) 
so that 
2 2 
Wwe. w 
pea (6.30) 
Wei Wee 
Thus 
1 we. w2 
lm R,L = 1 pr } a 
w—0 w | (wwa) (w wee) 
= 1— Wri + Wee 
WeiWce 
1, Mee Mi Me 
E EOMe qB qeB 
2 
= Ta 
wr, 
2 
C 
= lee (6.31) 
VA 


where vł = B?/tigp is the Alfvén velocity. Thus, at low frequency, both R and L are 
related to Alfvén modes. The n? = L mode is the slow mode (larger k) and the n? = R 
mode is the fast mode (smaller k). Figure 6.1 shows the frequency dependence of the 
n? = R, L modes. 
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Figure 6.1: Propagation parallel to the magnetic field. 


Having determined the eigenvalues for 6 = 0, the associated eigenvectors can now 
be found. These are obtained by substituting the eigenvalue back into the original set of 
equations; for example, substitution of n? = R into Eq.(6.21) gives 


-D -iD 0 Ey 
iD -D, 0 |-| Ey | =0, (6.32) 
0 0 P E; 
so that the eigenvector associated with n? = R is 
Ey * A 2 
— = —i, for eigenvalue n^ = R. (6.33) 
Ey 


The implication of this eigenvector can be seen by considering the root n = +R so that 
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the electric field in the plane orthogonal to z has the form 


E, = Re{E, (+ ig) exp(ik,z — iwt)} (6.34) 
= |E |{ĉcos(kzz —wt + ô) —ysin(k,z — wt + 6)} ` 
where E; = |E; |e®. This is a right-hand circularly polarized wave propagating in the 


positive z direction; hence the nomenclature R. Similarly, the n? = L root gives a left-hand 
circularly polarized wave. Linearly polarized waves may be constructed from appropriate 
sums and differences of these left- and right-hand circularly polarized waves. 

In summary, two distinct modes exist when the wavevector happens to be exactly par- 
allel to the magnetic field (0 = 0): a right-hand circularly polarized wave with dispersion 
n? = R with n — oo at the electron cyclotron resonance and a left-hand circularly polar- 
ized mode with dispersion n? = L with n — co at the ion cyclotron resonance. Since ion 
cyclotron motion is left-handed (mnemonic ‘Lion’) it is reasonable that a left-hand circu- 
larly polarized wave resonates with ions, and vice versa for electrons. At low frequencies, 
these modes become Alfvén modes with dispersion n2? = 1 + c?/v3 for 0 = 0. In the 
Chapter 4 discussion of Alfvén modes the dispersions of both compressional and shear 
modes were found to reduce to c?k?/w? = n? = c?/v% for 9 = 0. One one may ask why 
a ‘1’ term did not appear in the Chapter 4 dispersion relations? The answer is that the ‘1’ 
term comes from displacement current, a quantity neglected in the Chapter 4 derivations. 
The displacement current term shows that if the plasma density is so low (or the magnetic 
field is so high) that v4 becomes larger than c, then Alfvén modes become ordinary vac- 
uum electromagnetic waves propagating at nearly the speed of light. In order for a plasma 
to demonstrate significant Alfvenic (i.e., MHD behavior) it must satisfy B/\/[i9p << c or 
equivalently have wei << Wpi- 


6.2.2 Cutoffs and resonances 


The general situation where n? — oo is called a resonance and corresponds to the wave- 


length going to zero. Any slight dissipative effect in this situation will cause large wave 
damping. This is because if wavelength becomes infinitesimal and the fractional attenua- 
tion per wavelength is constant, there will be a near-infinite number of wavelengths and the 
wave amplitude is reduced by the same fraction for each of these. Figure 6.1 also shows 
that it is possible to have a situation where n? = 0. The general situation where n? = 0 is 
called a cutoff and corresponds to wave reflection, since n changes from being pure real to 
pure imaginary. If the plasma is non-uniform, it is possible for layers to exist in the plasma 
where either n? — 00 or n? = 0; these are called resonance or cutoff layers. Typically, if 
a wave intercepts a resonance layer, it is absorbed whereas if it intercepts a cutoff layer it 
is reflected. 


6.2.3 Mode behavior at 0 = 7/2 


When @ = 7/2 Eq.(6.20) becomes 


S —iD 0 Ez 
iD S-n? 0 -| E | =0 (6.35) 
0 o0 P-n? E 


N 
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and again two distinct modes appear. The first mode has as its eigenvector the condition 
that Æ, 4 0. The associated eigenvalue equation is P — n? = 0 or 


W= REY w (6.36) 


which is just the dispersion for an electromagnetic plasma wave in an unmagnetized plasma. 
This is in accordance with the prediction that modes involving particle motion strictly par- 
allel to the magnetic field are unaffected by the magnetic field. This mode is called the 
ordinary mode because it is unaffected by the magnetic field. 

The second mode involves both E, and E, and has the eigenvalue equation S(S — 
n?) — D? = 0 which gives the dispersion relation 

bi, a le RL 
n= ra Snai (6.37) 

Cutoffs occur here when either R = 0 or L = 0 and a resonance occurs when S = 0. Since 
this mode depends on the magnetic field, it is called the extraordinary mode.The S = 0 
resonance is called a hybrid resonance because it depends on a hybrid of w2, and we, 
terms (note that w2, terms depend on single-particle physics whereas we, terms depend 
on collective motion physics). Because S is quadratic in w?, the equation S = 0 has two 
distinct roots and these are found by explicitly writing 


ise u2 
= Pe pe = 
Seto ga =O (6.38) 


ci 


A plot of this expression shows that the two roots are well separated. The large root may 
be found by assuming that w ~ O(wece) in which case the ion term becomes insignificant. 
Dropping the ion term shows that the large root of S is simply 


wih = ee + we (6.39) 


which is called the upper hybrid frequency. The small root may be found by assuming that 
w? << w2, which gives the lower hybrid frequency 


oanu pa, (6.40) 


6.2.4 Very low frequency modes where @ is arbitrary 


Equation (6.31) shows that for w << we; 


S x RxLx1+Êm} 


N 


D-~ 0 (6.41) 
so the cold plasma dispersion simplifies to 
S-n? 0 NeNz Ey 
0 S-n? 0 Ey | =0. (6.42) 
NgeNz 0 P- n2 E 
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Because D = 0 the determinant factors into two modes, one where 


(S—n?) By =0 (6.43) 
and the other where à l 
Nİ NgNz Eg I 
NgNz P-n? | f | E, | a (6:44) 


The former gives the dispersion relation 


n? =S (6.45) 


with Æ, Æ 0 as the eigenvector. This mode is the fast or compressional mode, since in the 
limit where the displacement current can be neglected, Eq.(6.45) becomes w? = k?v?} . The 
latter mode involves finite Æ, and Æ, and has the dispersion 


rP 
TATS, 
which is the inertial Alfvén wave w? = k?v3/ (1+ k2c?/w?.) in the limit that the dis- 
placement current can be neglected. 


(S-n?) (6.46) 


n 


6.2.5 Modes where w and 0 are arbitrary 


The respective behaviors at 0 = O and at 6 = 7/2 and the low frequency Alfvén modes 
gave a useful introduction to the cold plasma modes and, in particular, showed how modes 
can be subject to cutoffs or resonances. We now evaluate the determinant of the matrix in 
Eq.(6.20) for arbitrary 0 and arbitrary w; after some algebra this determinant can be written 
as 


An* — Bn? +C =0 (6.47) 
where 
A = Ssin?0+ Pcos?6 
B = (S?— D?)sin?0 + PS(1 +cos76) (6.48) 
C = P(S? -— DS PRL. 


Equation (6.47) is quadratic in n? and has the two roots 


2 Bt+VB?—4AC 
p 2A l 
Thus, the two distinct modes in the special cases of (i) 9 = 0, 7/2 or (ii) w << wei; were 
just particular examples of the more general property that a cold plasma supports two dis- 
tinct types of modes. Using a modest amount of algebraic manipulation (cf. assignments) it 
is straightforward to show that for real 0 the quantity B? — 4 AC is positive definite, since 


n (6.49) 


B? — 4AC = (S? — D? — SP)” sin +0 + 4P? D? cos 0. (6.50) 


Thus n is either pure real (corresponding to a propagating wave) or pure imaginary (corre- 
sponding to an evanescent wave). 

From Eqs.(6.47) and (6.48) it is seen that cutoffs occur when C = 0 which happens if 
P = 0, L = 0, or R = 0. Also, resonances correspond to having A — 0 in which case 


S sin? + Pcos76 ~ 0. (6.51) 
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6.2.6 Wave normal surfaces 


The information contained in a dispersion relation can be summarized in a qualitative, 
visual manner by a wave normal surface which is a polar plot of the phase velocity of the 
wave normalized to c. Since n = ck/w, a wave normal surface is just a plot of 1/n(0) v. 
0. The most basic wave normal surface is obtained by considering the equation for a light 
wave in vacuum, 


3? ; 
($ = ev?) E= 0, (6.52) 


which has the simple dispersion relation 


1 w? 
2 gpg” 1. (6.53) 
Thus the wave normal surface of a light wave in vacuum is just a sphere of radius unity 
because w/k = 1/nc is independent of direction. Wave normal surfaces of plasma waves 
are typically more complicated because n usually depends on 0. The radius of the wave 
normal surface goes to zero at a resonance and goes to infinity at a cutoff (since 1/n — 0 
at a resonance, | /n — oo at a cutoff). 


6.2.7 Taxonomy of modes — the CMA diagram 


Equation (6.49) gives the general dispersion relation for arbitrary 0. While formally correct, 
this expression is of little practical value because of the complicated chain of dependence of 
n? on several variables. The CMA diagram (Clemmow and Mullaly (1955), Allis (1955)) 
provides an elegant method for revealing and classifying the large number of qualitatively 
different modes embedded in Eq.(6.49). 

In principle, Eq. (6.49) gives the dependence of n? on the six parameters 0, w, Wye, 
Wpi, Wee, and wei. However, wp; and wpe are not really independent parameters and neither 
are wei and Wee because w2, /w?, = (m;/me)” and w/w, = M; /Me for singly charged 
ions. Thus, once the ion species has been specified, the only free parameters are the density 
and the magnetic field. Once these have been specified, the plasma frequencies and the 
cyclotron frequencies are determined. It is reasonable to normalize these frequencies to 
the wave frequency in question since the quantities S, P, D depend only on the normalized 
frequencies. Thus, n? is effectively just a function of 0, we. /w? and w2, /w?. Pushing this 
simplification even further, we can say that for fixed we, /w? and w2, /w?, the refractive 
index n is just a function of 6. Then, once n = n(6) is known, it can be used to plot a 
wave normal surface, i.e., w /kc plotted v. 0. 

The CMA diagram is developed by first constructing a chart where the horizontal axis is 
In (w7 /W? + w?,/w?) and the vertical axis is In (w2, /w*) . For a given w any point on this 
chart corresponds to a unique density and a unique magnetic field. If we were ambitious, 
we could plot the wave normal surfaces 1/n v. 6 for a very large number of points on this 
chart, and so have plots of dispersions for a large set of cold plasmas. While conceivable, 
such a thorough examination of all possible combinations of density and magnetic field 
would require plotting an inconveniently large number of wave normal surfaces. 
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Figure 6.2: CMA diagram. 
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It is actually unnecessary to plot this very large set of wave normal surfaces because 
it turns out that the qualitative shape (i.e., topology) of the wave normal surfaces changes 
only at specific boundaries in parameter space and away from the these boundaries the 
wave normal surface deforms, but does not change its topology. Thus, the parameter space 
boundaries enclose regions of parameter space where the qualitative shape (topology) of 
wave normal surfaces does not change. The CMA diagram, shown in Fig.6.2 charts these 
parameter space boundaries and so provides a powerful method for classifying cold plasma 
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modes. Parameter space is divided up into a finite number of regions, called bounded vol- 
umes, separated by curves in parameter space, called bounding surfaces, across which the 
modes change qualitatively. Thus, within a bounded volume, modes change quantitatively 
but not qualitatively. For example, if Alfvén waves exist at one point in a particular bounded 
volume, they must exist everywhere in that bounded volume, although the dispersion may 
not be quantitatively the same at different locations in the volume. 
The appropriate choice of bounding surfaces consists of: 
1. The principle resonances which are the curves in parameter space where n? has a 
resonance at either 0 = O or 0 = 7/2. Thus, the principle resonances are the curves 
R = œ (ie., electron cyclotron resonance), L = oo (i.e., ion cyclotron resonance), 
and S = 0 (i.e., the upper and lower hybrid resonances). 


2. The cutoffs R = 0, L = 0, and P = 0. 
The behavior of wave normal surfaces inside a bounded volume and when crossing a 
bounded surface can be summarized in five theorems (Stix 1962), each a simple conse- 
quence of the results derived so far: 


1. Inside a bounded volume n cannot vanish. Proof: n vanishes only when PRL = 0, 
but P = 0, R = 0 and L = 0 have been defined to be bounding surfaces. 


2. If n? has a resonance (i.e., goes to infinity) at any point in a bounded volume, then 
for every other point in the same bounded volume, there exists a resonance at some 
unique angle @,-, and its associated mirror angles, namely —0,<;, 7 — Ôres, and 
— (T — Ores) but at no other angles. Proof: If n? — oo then A — 0 in which case 
tan?0,c; = —P/S determines the unique fpes. Now tan?(7 — Ores) = tan Ores 
so there is also a resonance at the supplement 0 = a — pes- Also, since the square 
of the tangent is involved, both es and 7 — Ores may be replaced by their negatives. 
Neither P nor S can change sign inside a bounded volume and both are single valued 
functions of their location in parameter space. Thus, — P/S can only change sign at 
a bounding surface. In summary, if a resonance occurs at any point in a bounding 
surface, then a resonance exists at some unique angle pes and its associated mirror 
angles at every point in the bounding surface. Resonances only occur when P and 
S have opposite signs. Since 1/n goes to zero at a resonance, the radius of a wave 
normal surface goes to zero at a resonance. 


3. At any point in parameter space, for a given interval in 0 in which n is finite, n is 
either pure real or pure imaginary throughout that interval. Proof: n? is always real 
and is a continuous function of 0. The only situation where n can change from being 
pure real to being pure imaginary is when n? changes sign. This occurs when n? 
passes through zero, but because of the definition for bounding surfaces, n? does not 
vanish inside a bounded volume. Although n? may change sign when going through 
infinity, this situation is not relevant because the theorem was restricted to finite n. 


4. n is symmetric about 0 = 0 and 0 = 7/2. Proof: n is a function of sin?6 and of 
cos 70, both of which are symmetric about 0 = 0 and 0 = 7/2. 


5. Except for the special case where the surfaces PD = 0 and RL = PS intersect, the 
two modes may coincide only at 0 = 0 or at 0 = 7/2. Proof: For 0 < 0 < 7/2 the 
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square root in Eq.(6.49) is 


V B2—4AC = V(aL — SP) sin 40 + 4P2 D2 cos20 (6.54) 
and can only vanish if PD = 0 and RL = PS simultaneously. 
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Figure 6.3: (a), (b), (c) show types of wave normal surfaces; (d) and (e) show permissible 
overlays of wave normal surfaces. 


These theorems provide sufficient information to characterize the morphology of wave 
normal surfaces throughout all of parameter space. In particular, the theorems show that 
only three types of wave normal surfaces exist. These are ellipsoid, dumbbell, and wheel 
as shown in Fig.6.3(a,b,c) and each is a three-dimensional surface symmetric about the z 
axis. 

We now discuss the features and interrelationships of these three types of wave normal 
surfaces. In this discussion, each of the two modes in Eq.(6.49) is considered separately; 
i.e., either the plus or the minus sign is chosen. The convention is used that a mode is 
considered to exist (i.e., has a wave normal surface) only if n? > 0 for at least some range 
of 0; if n? < 0 for all angles, then the mode is evanescent (i.e., non-propagating) for all 
angles and is not plotted. The three types of wave normal surfaces are: 
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1. Bounded volume with no resonance and n? > 0 at some point in the bounded volume. 
Since n? = 0 occurs only at the bounding surfaces and n? — oo only at resonances, 
n? must be positive and finite at every 0 for each location in the bounded volume. 
The wave normal surface is thus ellipsoidal with symmetry about both 0 = O and 
0 = 1/2. The ellipse may deform as one moves inside the bounded volume, but will 
always have the morphology of an ellipse. This type of wave normal surface is shown 
in Fig.6.3(a). The wave normal surface is three dimensional and is azimuthally sym- 
metric about the z axis. 


2. Bounded volume having a resonance at some angle 0,.., where 0 < pes < 1/2 and 
n?(0) positive for 0 < Ores. At Ores, n — © so the radius of the wave normal surface 
goes to zero. For 0 < Ores, the wave normal surface exists (n is pure real since 
n? > 0) and is plotted. At resonances n?(0) passes from —oo to +00 or vice versa. 
This type of wave normal surface is a dumbbell type as shown in Fig. 6.3(b). 


3. Bounded volume having a resonance at some angle pes where O < Ores < 7/2 and 
n?(0) positive for 0 > Ores. This is similar to case 2 above, except that now the wave 
normal surface exists only for angles greater than Opes resulting in the wheel type 
surface shown in Fig.6.3(c). 

Consider now the relationship between the two modes (plus and minus sign) given by 
Eq.(6.49). Because the two modes cannot intersect (cf. theorem 5) at angles other than 
0 = 0,7/2 and mirror angles, if one mode is an ellipsoid and the other has a resonance 
(i.e., is a dumbbell or wheel), the ellipsoid must be outside the other dumbbell or wheel; 
for if not, the two modes would intersect at an angle other than 0 = 0 or 0 = 77/2). This is 
shown in Figs. 6.3(d) and (e). 

Also, only one of the modes can have a resonance, so at most one mode in a bounded 
volume can be a dumbbell or wheel. This can be seen by noting that a resonance occurs 
when A — 0. In this case B? >> |4AC| in Eq.(6.49) and the two roots are well-separated. 
This means that the binomial expansion can be used on the square root in Eq.(6.49) to obtain 


ow Bt (B —2AC/B) 
2A 
B C 
n? x oe n? ~j (6.55) 


where |n} | >> |n? | since B? >> |4AC|. The root n? has the resonance and the root 


n? has no resonance. Since the wave normal surface of the minus root has no resonance, 


its wave normal surface must be ellipsoidal (if it exists). Because the ellipsoidal surface 
must always lie outside the wheel or dumbbell surface, the ellipsoidal surface will have a 
larger value of w/kc than the dumbbell or wheel at every 0 and so the ellipsoidal mode 
will always be the fast mode. The mode with the resonance will be a dumbbell or wheel, 
will lie inside the ellipsoidal surface, and so will always be the slow mode. This concept 
of well-separated roots is quite useful and, if the roots are well-separated, then Eq.(6.47) 
can be solved approximately for the large root (slow mode) by balancing the first two terms 
with each other, and for the small root (fast mode) by balancing the last two terms with 
each other. 


Parameter space is subdivided into thirteen bounded volumes, each potentially con- 
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taining two normal modes corresponding to two qualitatively distinct propagating waves. 
However, since two modes do not exist in all bounded volumes, the actual number of modes 
is smaller than twenty-six. As an example of a bounded volume with two waves, the wave 
normal surfaces of the fast and slow Alfvén waves are in the upper right hand corner of the 
CMA diagram since this bounded volume corresponds to w << wei and w << Wye, 1.€., 
above L = 0 and to the right of P = 0. 


6.2.8 Use of the CMA diagram 


The CMA diagram can be used in several ways. For example it can be used to (i) identify 
all allowed cold plasma modes in a given plasma for various values of w or (ii) investigate 
how a given mode evolves as it propagates through a spatially inhomogeneous plasma and 
possibly intersects resonances or cutoffs due to spatial variation of density or magnetic 
field. 

Let us consider the first example. Suppose the plasma is uniform and has a prescribed 
density and magnetic field. Since In (o2 + w?.) / w?] and In [w2 / w?] are the coordi- 
nates of the CMA diagram, varying w corresponds to tracing out a line having a slope 
of 45° and an offset determined by the prescribed density and magnetic field. High fre- 
quencies correspond to the lower left portion of this line and low frequencies to the upper 
right. Since the allowed modes lie along this line, if the line does not pass through a given 
bounded volume, then modes inside that bounded volume do not exist in the specified 
plasma. 

Now consider the second example. Suppose the plasma is spatially non-uniform in such 
a way that both density and magnetic field are a function of position. To be specific, suppose 
that density increases as one moves in the x direction while magnetic field increases as one 
moves in the y direction. Thus, the CMA diagram becomes a map of the actual plasma. A 
wave with prescribed frequency w is launched at some position x,y and then propagates 
along some trajectory in parameter space as determined by its local dispersion relation. 
The wave will continuously change its character as determined by the local wave normal 
surface. Thus a wave which is injected with a downward velocity as a fast Alfvén mode in 
the upper-right bounded volume will pass through the L = oo bounding surface and will 
undergo only a quantitative deformation. In contrast, a wave which is launched as a slow 
Alfvén mode (dumbbell shape) from the same position will disappear when it reaches the 
L = oo bounding surface, because the slow mode does not exist on the lower side (high 
frequency side) of the L = oo bounding surface. The slow Alfvén wave undergoes ion 
cyclotron resonance at the L = oo bounding surface and will be absorbed there. 


6.3 Dispersion relation expressed as a relation between 72 
and n? 


The CMA diagram is very useful for classifying waves, but is often not so useful in practical 
situations where it is not obvious how to specify the angle 0. In a practical situation a wave 
is typically excited by an antenna that lies in a plane and the geometry of the antenna 
imposes the component of the wavevector in the antenna plane. The transmitter frequency 
determines w. 
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For example, consider an antenna located in the x = 0 plane and having some specified 
z dependence. When Fourier analyzed in z, such an antenna would excite a characteristic 
k, spectrum. In the extreme situation of the antenna extending to infinity in the x = 0 plane 
and having the periodic dependence exp(ik,z), the antenna would excite just a single kz. 
Thus, the antenna-transmitter combination in this situation would impose k, and w but 
leave k, undetermined. The job of the dispersion relation would then be to determine k,,. It 
should be noted that antennas which are not both infinite and perfectly periodic will excite 
a spectrum of k, modes rather than just a single k, mode. 
By writing n2 = n? sin? 0 and n2 = n? cos? 0, Eqs. (6.47) and (6.48) can be expressed 
as a quadratic equation for n2, namely 
Sn —[$(S + P) — D?|nz + P[S? — D?) =0 (6.56) 


x 


where 

f= 5. (6.57) 
If the two roots of Eq.(6.56) are well-separated, the large root is found by balancing the 
first two terms to obtain 


S(S + P) — D? 
n? ~ a (large root) (6.58) 
or in the limit of large P (i.e., low frequencies), 
2 SP 
a ie S ? 
The small root is found by balancing the last two terms of Eq.(6.56) to obtain 


n (large root). (6.59) 


Pls? — D? 
2 ~ ae: P- A A (small root) (6.60) 


~ i be (small root). (6.61) 


Thus, any given n2 always has an associated large n? mode and an associated small n2 
mode. Because the phase velocity is inversely proportion to the refractive index, the root 
with large n? is called the slow mode and the root with small n2 is called the fast mode. 

Using the quadratic formula it is seen that the exact form of these two roots of Eq.(6.56) 
is given by 


B(S + P)— D? + \/[B(S— P) — D?]? +4PD?n? 
nè = : (6.62) 
= 2S 
It is clear that n? can become infinite only when S = 0. Situations where n2 is complex 
(i.e., neither pure real or pure imaginary) can occur when P is large and negative in which 
case the argument of the square root can become negative. In these cases, 0 also becomes 
complex and is no longer a physical angle. This shows that considering real angles between 
O < 0 < 2r does not account for all possible types of wave behavior. The regions where 
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n2 becomes complex is called a region of inaccessibility and is a region where Eq.(6.56) 
does not have real roots. If a plasma is non-uniform in the x direction so that S, P, and D 
are functions of x and w? < we + wri so that P is negative, the boundaries of a region of 
inaccessibility (if such a region exists) are the locations where the square root in Eq.(6.62) 
vanishes, i.e., where there is a solution for S(S — P) — D? =+,/—4PD?n2. 


6.4 A journey through parameter space 


Imagine an enormous plasma where the density increases in the x direction and the mag- 
netic field points in the z direction but increases in the y direction. Suppose further that a 
radio transmitter operating at a frequency w is connected to a hypothetical antenna which 
emits plane waves, i.e. waves with spatial dependence exp(ik - x). These assumptions are 
somewhat self-contradictory because, in order to excite plane waves, an antenna must be 
infinitely long in the direction normal to k and if the antenna is infinitely long it cannot be 
localized. To circumvent this objection, it is assumed that the plasma is so enormous that 
the antenna at any location is sufficiently large compared to the wavelength in question to 
emit waves that are nearly plane waves. 

The antenna is located at some point x,y in the plasma and the emitted plane waves 
are detected by a phase-sensitive receiver. The position x, y corresponds to a point in CMA 
space. The antenna is rotated through a sequence of angles 0 and as the antenna is rotated, 
an observer walks in front of the antenna staying exactly one wavelength A = 27/k from 
the face of the antenna. Since A is proportional to 1/n = w/kc at fixed frequency, the 
locus of the observer’s path will have the shape of a wave normal surface, i.e., a plot of 1/n 
versus 0. 

Because of the way the CMA diagram was constructed, the topology of one of the two 
cold plasma modes always changes when a bounding surface is traversed. Which mode is 
affected and how its topology changes on crossing a bounding surface can be determined 
by monitoring the polarities of the four quantities S, P, R, L within each bounded volume. 
P changes polarity only at the P = 0 bounding surface, but R and L change polarity when 
they go through zero and also when they go through infinity. Furthermore, S = (R+ L)/2 
changes sign not only when S = 0 but also at R = co and at L = oo. 

A straightforward way to establish how the polarities of S, P, R, L change as bounding 
surfaces are crossed is to start in the extreme lower left corner of parameter space, corre- 
sponding to w? >> wae w2. This is the limit of having no plasma and no magnetic field 
and so corresponds to unmagnetized vacuum. The cold plasma dispersion relation in this 
limit is simply n? = 1;i.e., vacuum electromagnetic waves such as ordinary light waves 
or radio waves. Here S = P = R = L = 1 because there are no plasma currents. Thus 
S, P, R, L are all positive in this bounded volume, denoted as Region 1 in Fig.6.2 (regions 
are labeled by boxed numbers). To keep track of the respective polarities, a small cross is 
sketched in each of the 13 bounded volumes. The signs of L and R. are noted on the left 
and right of the cross respectively, while the sign of S is shown at the top and the sign of 
P is shown at the bottom. 

In traversing from region 1 to region 2, R passes through zero and so reverses polar- 
ity but the polarities of L, S, P are unaffected. Going from region 2 to region 3, S passes 
through zero so the sign of S reverses. By continuing from region to region in this manner, 
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the plus or minus signs on the crosses in each bounded volume are established. It is impor- 
tant to remember that S changes sign at both S = 0 and the cyclotron resonances L = oo 
and R = oo, but at all other bounding surfaces, only one quantity reverses sign. 

Modes with resonances (i.e., dumbbells or wheels) only occur if S and P have opposite 
sign which occurs in regions 3, 7, 8, 10, and 13. The ordinary mode (i.e., 0 = 7/2, n? = P) 
exists only if P > 0 and so exists only in those regions to the left of the P = O bounding 
surface. Thus, to the right of the P = 0 bounding surface only extraordinary modes exist 
(i.e., only modes where n? = RL/S at 0 = 7/2). Extraordinary modes exist only if 
RL/S > 0 which cannot occur if an odd subset of the three quantities R, L and S is 
negative. For example, in region 5 all three quantities are negative so extraordinary modes 
do not exist in region 5. The parallel modes n? = R, L do not exist in region 5 because 
R and L are negative there. Thus, no modes exist in region 5, because if a mode were to 
exist there, it would need to have a limiting behavior of either ordinary or extraordinary at 
0 = T /2 and of either right or left circularly polarized at 0 = 0. 

When crossing a cutoff bounding surface (R = 0, L = 0, or P = 0), the outer (i.e., 
fast) mode has its wave normal surface become infinitely large, w? /k?c? =1/n? — oo. 
Thus, immediately to the left of the P = 0 bounding surface, the fast mode (outer mode) 
is always the ordinary mode, because by definition this mode has the dispersion n? = P at 
0 = T /2 and so has a cutoff at P = 0. As one approaches the P = 0 bounding surface from 
the left, all the outer modes are ordinary modes and all disappear on crossing the P = 0 
line so that to the right of the P = 0 line there are no ordinary modes. 

In region 13 where the modes are Alfvén waves, the slow mode is the n? = L mode 
since this is the mode which has the resonance at L = oo. The slow Alfvén mode is the 
inertial Alfvén mode while the fast Alfvén mode is the compressional Alfvén mode. Going 
downwards from region 13 to region 11, the slow Alfvén wave undergoes ion cyclotron 
resonance and disappears, but the fast Alfvén wave remains. Similar arguments can be 
made to explain other boundary crossings in parameter space. 

A subtle aspect of this taxonomy is the division of region 6 into two sub-regions 6a, and 
6b. This subtlety arises because the dispersion at 0 = 7/2 has the form 


2 RL + PS +|RL-— PS| 
H S 
25 

In region 6, both S and P are positive. If RL— PS is also positive, then the plus sign gives 
the extraordinary mode which is the slow mode (bigger n, inner of the two wave normal 
surfaces). On the other hand, if RL — PS is negative, then the absolute value operator 
inverts the sign of RL — PS and the minus sign now gives the extraordinary mode which 
will be the fast mode (smaller n, outer of the two wave normal surfaces). Region 8 can 
also be divided into two regions (omitted here for clarity) separated by the RL = PS line. 
In region 8 the ordinary mode does not exist, but the extraordinary mode will be given by 
either the plus or minus sign in Eq.(6.63) depending on which side of the RL = PS line 
one is considering. 

For a given plasma density and magnetic field, varying the frequency corresponds to 
moving along a ‘mode’ line which has a 45 degree slope on the log-log CMA diagram. If 
the plasma density is increased, the mode line moves to the right whereas if the magnetic 
field is increased, the mode line moves up. Since any single mode line cannot pass through 
all 13 regions of parameter space only a limited subset of the 13 regions of parameter space 


= P,RL/S. (6.63) 
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can be accessed for any given plasma density and magnetic field. Plasmas with ee > w% 
are often labelled ‘overdense’ and plasmas with Wre < w, are correspondingly labeled 
‘underdense’.For overdense plasmas, the mode line passes to the right of the intersection 
of the P = 0, R = oo bounding surfaces while for underdense plasmas the mode line 
passes to the left of this intersection. Two different plasmas will be self-similar if they have 
similar mode lines. For example if a lab plasma has the same mode line as a space plasma 
it will support the same kind of modes, but do so in a scaled fashion. Because the CMA 
diagram is log-log the bounding surface curves extend infinitely to the left and right of the 
figure and also infinitely above and below it; however no new regions exist outside of what 
is sketched in Fig.6.2. 

The weakly-magnetized case corresponds to the lower parts of regions 1-5, while the 
low-density case corresponds to the left parts of regions 1, 2, 3, 6, 9, 10, and 12. The 
CMA diagram provides a visual way for categorizing a great deal of useful information. In 
particular, it allows identification of isomorphisms between modes in different regions of 
parameter space so that understanding developed about the behavior for one kind of mode 
can be readily adapted to explain the behavior of a different, but isomorphic mode located 
in another region of parameter space. 


6.5 High frequency waves: Altar-Appleton-Hartree 
dispersion relation 


Examination of the dielectric tensor elements S, P,and D shows that while both ion and 
electron terms are of importance for low frequency waves, for high frequency waves (w >> 
Wei, Wpi) the ion terms are unimportant and may be dropped. Thus, for high frequency 
waves the dielectric tensor elements simplify to 


Sg eee 
w? p we 
2 
W 
Pal— “re (6.64) 
ie 
ioe) 
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and the corresponding R and L terms are 


w2 


RSI- gy ua) 
= “2 Wee (6.65) 
L=1= pe 


w (W — Wee) 
The development of long distance short-wave radio communication in the 1930’s motivated 
investigations into how radio waves bounce from the ionosphere. Because the bouncing 
involves a P = 0 cutoff and because the ionosphere has we, of order w2, but usually larger, 
the relevant frequencies must be of the order of the electron plasma frequency and so are 
much higher than both the ion cyclotron and ion plasma frequencies. Thus, ion effects are 
unimportant and so all ion terms may be dropped in order to simplify the analysis. 
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Perhaps the most important result of this era was a peculiar, but useful, reformulation 
by Appleton, Hartree, and Altar? (Appleton 1932) of the w >> Wi, Wo; limit of Eq.(6.49). 
The intent of this reformulation was to express n? in terms of its deviation from the vacuum 
limit, n? = 1. An obvious way to do this is to define € = n? — 1 and then re-write Eq.(6.47) 
as an equation for €, namely 


A(@? +2€ +1) — B(E+1) + C=0 (6.66) 


or, after regrouping, 


A? + €(2A—-B)+A—-B+C=0. (6.67) 
Unfortunately, when this expression is solved for €, the leading term is —1 and so this 
attempt to find the deviation of n? from its vacuum limit fails. However, a slight rewriting 
of Eq.(6.67) as 
A-B+C  2A-B 
J £ 


and then solving for 1/£, gives 


+A=0 (6.68) 


pa 2(A— B+C) 

— B—2A+VB?—4AC 
This expression does not have a leading term of —1 and so allows the solution of Eq.(6.49) 
to be expressed as 


(6.69) 


2(A- B+C) 
B — 2A + VB? —4AC 
In the w >> wci, Wp; limit where S, P, D are given by Eq.(6.64), algebraic manipulation of 
Eq.(6.70) (cf. assignments) shows that there exists a common factor in the numerator and 
denominator of the second term. After cancelling this common factor, Eq.(6.70) reduces 


n? =14 (6.70) 


to 
2 2 
ope vs Whe 
Ú we 
n?=1 ; : (6.71) 
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2 
4 ned a 
r= |“ sinto 442e h: =) cos 20. (6.72) 
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Equation (6.71) is called the Altar-Appleton-Hartree dispersion relation (Appleton 1932) 
and has the desired property of showing the deviation of n? from the vacuum dispersion 
n? =1. 

We recall that the cold plasma dispersion relation simplified considerably when either 
0 = O or 0 = 7/2. A glance at Eq.(6.71) shows that this expression reduces indeed to 
n? = R,L for 0 = 0. Somewhat more involved manipulation shows that Eq.(6.71) also 
reduces to n? = P and n? = RL/S for 0 = 1/2. 


3See discussion by Swanson (1989) regarding the recent addition of Altar to this citation 
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Equation (6.71) can be Taylor-expanded in the vicinity of the principle angles 6 = 0 
and 0 = 7/2 to give dispersion relations for quasi-parallel or quasi-perpendicular propaga- 
tion. The terms quasi-longitudinal and quasi-transverse are commonly used to denote these 
situations. The nomenclature is somewhat unfortunate because of the possible confusion 
with the traditional convention that longitudinal and transverse refer to the orientation of 
k relative to the wave electric field. Here, longitudinal means kis nearly parallel to the sta- 
tic magnetic field while transverse means k is nearly perpendicular to the static magnetic 
field. 


6.5.1 Quasi-transverse modes (0 ~ 7/2) 


For quasi-transverse propagation, the first term in [ dominates, that is 


2 
4 2 2 
w ; w w ; 
= sin t0 >> 4- (: = =) cos 70. (6.73) 
z Ta ; 
In this case a binomial expansion of I gives 
w2 w? w2 : cos 70 i 
r = sn’ |1 +45 1-5] =, 
w w2 w sin 40 
2 
2 na 
w w 
~ -E sin? +2 (: =) cot 70. (6.74) 
w w 


Substitution of T into Eq.(6.71) shows that the generalization of the ordinary mode disper- 
sion to angles in the vicinity of 1/2 is 


2 =, (6.75) 


The subscript + here means that the positive sign has been used in Eq.(6.71). This mode 
is called the QTO mode as an acronym for ‘quasi-transverse-ordinary’. 

Choosing the — sign in Eq.(6.71) gives the quasi-transverse-extraordinary mode or 
QTX mode. After a modest amount of algebra (cf. assignments) the QTX dispersion is 
found to be 


7 z (6.76) 
E A 
Io z am SS sin 20 
w w 


Note that the QTX mode has a resonance near the upper hybrid frequency. 
6.5.2 Quasi-longitudinal dispersion (0 ~ 0) 


Here, the term containing cos 76 dominates in Eq.(6.72). Because there are no cancellations 
of the leading terms in I with any remaining terms in the denominator of Eq.(6.71), it 
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suffices to keep only the leading term of I’. Thus, in this limit 


2 2 
w w w w 
eanan RET = cos ĝ| = —2 | 1 a | £ cosh 
wW w w w 


since P = 1 — we /w? is assumed to be negative. Upon substitution for T in Eq.(6.71) and 
then simplifying one obtains 


r2 (6.77) 


2 wpe? 
n=l ni , QLR mode (6.78) 
r= | cos 0 
Ww 
and 
2 Weel w? 
n2? =1 r , QLL mode. (6.79) 
1+ | £ cosh 


These simplified dispersions are based on the implicit assumption that | P| is large, because 
if P — 0 the presumption that Eq.(6.77) gives the leading term in I would be inappropriate. 
When w <|wee cos 0| the QLR mode (quasi-longitudinal, right-hand circularly polarized) 
is called the whistler or helicon wave. This wave is distinguished by having a descending 
whistling tone which shows up at audio frequencies on sensitive amplifiers connected to 
long wire antennas. Whistlers may have been heard as early as the late 19th century by tele- 
phone linesmen installing long telephone lines. They become a subject of some interest in 
the trenches of the First World War when German scientist H. Barkhausen heard whistlers 
on a sensitive audio receiver while trying to eavesdrop on British military communications; 
the origin of these waves was a mystery at that time. After the war Barkhausen (1930) and 
Eckersley (1935) proposed that the descending tone was due to a dispersive propagation 
such that lower frequencies traveled more slowly, but did not explain the source location 
or propagation trajectory. The explanation had to wait over two more decades until Storey 
(1953) finally solved the mystery by showing that whistlers were caused by lightning bolts 
and identified two main types of propagation. The first type, called a short whistler re- 
sulted from a lightning bolt in the opposite hemisphere exciting a wave which propagated 
dispersively along the Earth’s magnetic field to the observer. The second type, called a 
long whistler, resulted from a lightning bolt in the vicinity of the observer exciting a wave 
which propagated dispersively along field lines to the opposite hemisphere, then reflected, 
and traveled back along the same path to the observer. The dispersion would be greater in 
this round trip situation and also there would be a correlation with a click from the local 
lightning bolt. Whistlers are routinely observed by spacecraft flying through the Earth’s 
magnetosphere and the magnetospheres of other planets. 

The reason for the whistler’s descending tone can be seen by representing each lightning 
bolt as a delta function in time 


6(t) : if e Edu, (6.80) 


20 
A lightning bolt therefore launches a very broad frequency spectrum. Because the ionospheric 
electron plasma frequency is in the range 10-30 MHz, audio frequencies are much lower 
than the electron plasma frequency, i.e., wpe >> w and so |P| >> 1. The electron cy- 
clotron frequency in the ionosphere is of the order of 1 MHz so wee >> w also. Thus, the 
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whistler dispersion for acoustic (a few kHz) waves in the ionosphere is 


2 
aa (6.81) 


= | Wee 
| cos 0 


eae (6.82) 
e V |wee cos 0| 


Each frequency w in Eq.(6.80) has a corresponding k given by Eq.(6.82) so that the distur- 
bance g(x, t) excited by a lightning bolt has the form 


or 


1 . A 
g(a,t) = = if ere et dgw (6.83) 


where x = 0 is the location of the lightning bolt. Because of the strong dependence of k 
on w, contributions to the phase integral in Eq.(6.83) at adjacent frequencies will in general 
have substantially different phases. The integral can then be considered as the sum of 
contributions having all possible phases. Since there will be approximately equal amounts 
of positive and negative contributions, the contributions will cancel each other out when 
summed; this cancelling is called phase mixing. 

Suppose there exists some frequency w at which the phase k(w)x — wt has a local 
maximum or minimum with respect to variation of w. In the vicinity of this extremum, 
the phase is independent of frequency and so the contributions from adjacent frequencies 
constructively interfere and produce a finite signal. Thus, an observer located at some 
position x # 0 will hear a signal only at the time when the phase in Eq.(6.83) is at an 
extrema. The phase extrema is found by setting to zero the derivative of the phase with 
respect to frequency, i.e. setting 


Ok 
—r-t=0. (6.84) 
_ Ow 
From Eq.(6.82) it is seen that 
Ok  Wpe 1 
E 6.85 
ðw 2c /u|wWee cos 6| ee) 
so that the time at which a frequency w is heard by an observer at location x is 
a (6.86) 


2c \/wlwWee cos 0| 


This shows that lower frequencies are heard at later times, resulting in the descending tone 
characteristic of whistlers. 


6.6 Group velocity 


Suppose that at time t = 0 the electric field of a particular fast or slow mode is decomposed 
into spatial Fourier modes, each varying as exp(ik - x). The total wave field can then be 
written as 


E(x) = J dkE(k) exp(ik - x) (6.87) 
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where E(k) is the amplitude of the mode with wavenumber k. The dispersion relation 
assigns an w to each k, so that at later times the field evolves as 


E(x,t) = J dkE(k) exp(ik - x — iw(k)t) (6.88) 


where w(k) is given by the dispersion relation. The integration over k may be viewed as a 
summation of rapidly oscillatory waves, each having different rates of phase variation. In 
general, this sum vanishes because the waves add destructively or “phase mix”. However, 
if the waves add constructively, a finite E(x,t) will result. Denoting the phase by 


o(k) =k-x—w(k)t (6.89) 
it is seen that the Fourier components add constructively at extrema (minima or maxima) 
of ¢(k), because in the vicinity of an extrema, the phase is stationary with respect to k, 
that is, the phase does not vary with k. Thus, the trajectory x = x(t) along which E(x, t) 


is finite is the trajectory along which the phase is stationary. At time ¢ the stationary phase 
is the place where 0¢(k) /Ok vanishes which is where 


Ob Ow 
— =x-—t=0. . 
ak Ok (6.90) 
The trajectory of the points of stationary phase is therefore 
x(t) = vgt (6.91) 


where v, = Ow/Ok is called the group velocity. The group velocity is the velocity at 
which a pulse propagates in a dispersive medium and is also the velocity at which energy 
propagates. 

The phase velocity for a one-dimensional system is defined as vp, = w/k.In three 
dimensions this definition can be extended to be vpn = kw /k, i.e. a vector in the direction 
of k but with the magnitude w/k. 

Group and phase velocities are the same only for the special case where w is linearly 
proportional to k, a situation which occurs only if there is no plasma. For example, the 
phase velocity of electromagnetic plasma waves (dispersion w? = We + k?c?) is 


2% Jwe + k22 
at (6.92) 


Vpn =k ke 


which is faster than the speed of light. However, no paradox results because information 
and energy travel at the group velocity, not the phase velocity. The group velocity for this 
wave is evaluated by taking the derivative of the dispersion with respect to k giving 

Ow 


ar k 
cae Š (6.94) 


T Jg ree 
which is less than the speed of light. 

This illustrates an important property of the wave normal surface concept — a wave 
normal surface is a polar plot of the phase velocity and should not be confused with the 
group velocity. 


or 
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6.7 Quasi-electrostatic cold plasma waves 


Another useful way of categorizing waves is according to whether the wave electric field 
is: 


1. electrostatic so that V x E =O and E = — Vo 


or 


2. inductive so that V - E = 0 and in Coulomb gauge, E = —OA/0t, where A is the 
vector potential. 

An electrostatic electric field is produced by net charge density whereas an inductive 
field is produced by time-dependent currents. Inductive electric fields are always associated 
with time-dependent magnetic fields via Faraday’s law. 

Waves involving purely electrostatic electric fields are called electrostatic waves, whereas 
waves involving inductive electric fields are called electromagnetic waves because these 
waves involve both electric and magnetic wave fields. In actuality, electrostatic waves must 
always have some slight inductive component, because there must always be a small cur- 
rent which establishes the net charge density. Thus, strictly speaking, the condition for 
electrostatic modes is V x E ~0 rather than V x E = 0. 

In terms of Fourier modes where V is replaced by ik, electrostatic modes are those 
for which k x E =0 so that E is parallel to k; this means that electrostatic waves are 
longitudinal waves. Electromagnetic waves have k- E = 0 and so are transverse waves. 
Here, we are using the usual wave terminology where longitudinal and transverse refer to 
whether k is parallel or perpendicular to E. 

The electron plasma waves and ion acoustic waves discussed in the previous chapter 
were electrostatic, the compressional Alfvén wave was inductive, and the inertial Alfvén 
wave was both electrostatic and inductive. We now wish to show that in a magnetized 
plasma, the wheel and dumbbell modes in the CMA diagram always have electrostatic 
behavior in the region where the wave normal surface comes close to the origin, i.e., near 
the cross-over in the figure-eight pattern of these wave normal surfaces. For these waves, 
when n becomes large (i.e., near the cross-over of the figure-eight pattern of the wheel 
or dumbbell), n becomes nearly parallel to E and the magnetic part of the wave becomes 
unimportant. We now prove this assertion and also take care to distinguish this situation 
from another situation where n becomes infinite, namely at cyclotron resonances. 

When the two roots of the dispersion Ant — Bn? + C = 0 are well-separated (i.e., 
B? >> 4AC) the slow mode is found by assuming that n? is large. In this case the 
dispersion can be approximated as 4n+ — Bn? ~ 0 which gives the slow mode as n? ~ 
B/A. Resonance (i.e., n? — oo) can thus occur either from 

1. A= Ssin?0 + Pos 26 vanishing, or 


2. B= RLsin?6 + PS(1 + cos?0) becoming infinite. 

These two cases are different. In the first case S and P remain finite and the vanishing 
of A determines a critical angle 0,., = tan~!,/—P/S; this angle is the cross-over angle 
of the figure-eight pattern of the wheel or dumbbell. In the second case either R or L must 
become infinite, a situation occurring only at the R or L bounding surfaces. 

The first case results in quasi-electrostatic cold plasma waves, whereas the second case 
does not. To see this, the electric field is first decomposed into its longitudinal and trans- 
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verse parts 


E = Aan-E 
E = E-F (6.95) 


where ù = k = n /n is a unit vector in the direction of n. The cold plasma wave equation, 
Eq. (6.17) can thus be written as 


nn: (E' + E') — n? (Ef +E!) +K - (E' +E’) =0. (6.96) 
Since n - E’ = 0 and nn - E! = n? F! this expression can be recast as 
TZ 2T tle. F 
(R-n T)-B'+K - E'=0 (6.97) 
= 
where I is the unit tensor. If the resonance is such that 


n? >> Kij (6.98) 


where K;j are the elements of the dielectric tensor, then Eq.(6.97) may be approximated 
as 


Db ee le 
—VE'+K - E00. (6.99) 
This shows that the transverse electric field is 
1 
E' = -—K-E! (6.100) 
n 


which is much smaller in magnitude than the longitudinal electric field by virtue of Eq.(6.98). 
An easy way to obtain the dispersion relation (determinant of this system) is to dot 
Eq.(6.100) with n to obtain 


n- K -n=n?(S'sin20 + Pcos26) ~ 0 (6.101) 


which is just the first case discussed above. This argument is self-consistent because for 
the first case (i.e., A — 0) the quantities S, P, D remain finite so the condition given by 
Eq.(6.98) is satisfied. 

The second case, B — oo, occurs at the cyclotron resonances where S and D diverge 
so the condition given by Eq.(6.98) is not satisfied. Thus, for the second case the electric 
field is not quasi-electrostatic. 


6.8 Resonance cones 


The situation A — O corresponds to Eq.(6.101) which is a dispersion relation having the 
surprising property of depending on 0, but not on the magnitude of n. This limiting form 
of dispersion has some bizarre aspects which will now be examined. 

The group velocity in this situation can be evaluated by writing Eq.(6.101) as 


kS+k2P=0 (6.102) 
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and then taking the vector derivative with respect to k to obtain 


Os OP\ Ow 
s sD 4 2 up = 
2k,tS + 2k, 2P 4 (e ag he r) Ak 0 (6.103) 
which may be solved to give 
Ow kztS + kz2P 
ae 2 one. „ÖP (6.104) 
"Ow 7 Ow 
If Eq.(6.104) is dotted with k the surprising result 
Ow 
k-— =0 6.105 
ak , ( ) 


is obtained which means that the group velocity is orthogonal to the phase velocity. The 
same result may also be obtained in a quicker but more abstract way by using spherical 
coordinates in k space in which case the group velocity is just 


ðw ov l 0 ðw 
ðk ðk Kk 00" 
Applying this to Eq.(6.101), it is seen that the first term on the right hand side vanishes 
because the dispersion relation is independent of the magnitude of k. Thus, the group 
velocity is in the @ direction, so the group and phase velocities are again orthogonal, since 
k is orthogonal to 0. Thus, energy and information propagate at right angles to the phase 
velocity. 
A more physically intuitive interpretation of this phenomenon may be developed by 
“un-Fourier” analyzing the cold plasma wave equation, Eq.(6.17) giving 


(6.106) 


wes 
Vx VxE-GK-E=0. (6.107) 


The modes corresponding to A — 0 were obtained by dotting the dispersion relation with 
n, an operation equivalent to taking the divergence in real space, and then arguing that 
the wave is mainly longitudinal. Let us therefore assume that E ~ —V@ and take the 
divergence of Eq.(6.107) to obtain 


v- (K vo) =0 (6.108) 


which is essentially Poisson’s equation for a medium having dielectric tensor K. Equation 
(6.108) can be expanded to give 


ro ro 

— — =0. 6.109 

Ox? Oz? ( ) 
If S and P have the same sign, Eq.(6.109) is an elliptic partial differential equation and so 


is just a distorted form of Poisson’s equation. In fact, by defining the stretched coordinates 
€=2/,/|S| and 7 = z/,/|P|, Eq.(6.109) becomes Poisson’s equation in € — n space. 
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Suppose now that waves are being excited by a line source q6(x)6(z)exp( —iwt), i.e., a 
wire antenna lying along the y axis oscillating at the frequency w. In this case Eq.(6.109) 
becomes 

e Fh q 
ae? i On? T |S P|3/2£0 


6(§)6(7) (6.110) 


so that the equipotential contours excited by the line source are just static concentric circles 
in €,7 or equivalently, static concentric ellipses in x, z. 

However, if S and P have opposite signs, the situation is entirely different, because 
now the equation is hyperbolic and has the form 


Oo P| 6 
— =!s|=5.- 11 
Ox? | S | Oz? One 
Equation (6.111) is formally analogous to the standard hyperbolic wave equation 
uw ow 
pe SC 6.112 
ae ° ar ae 


which has solutions propagating along the characteristics Y = (z + ct). Thus, the solu- 
tions of Eq.(6.111) also propagate along characteristics, i.e., 


b= (z+ V—P/Sz) (6.113) 


which are characteristics in the x — z plane rather than the x — t plane. For a line source, 
the potential is infinite at the line, and this infinite potential propagates from the source 


following the characteristics 
[>P 
z= © Tats (6.114) 


If the source is a point source, then the potential has the form 


q 

p2- 2 1/2 
4 EN 
ro( 5 +5) 


which diverges on the conical surface having cone angle tan Ocone = r/z = £,\/—S/P 
as shown in Fig.6.4. These singular surfaces are called resonance cones and were first 
observed by Fisher and Gould (1969). The singularity results because the cold plasma ap- 
proximation allows k to be arbitrarily large (i.e., allows infinitesimally short wavelengths). 
However, when k is made larger than w/v, the cold plasma assumption w/k >> vr be- 
comes violated and warm plasma effects need to be taken into account. Thus, instead of 
becoming infinite on the resonance cone, the potential is large and finite and has a fine 
structure determined by thermal effects (Fisher and Gould 1971). 


Wr, z) ~ (6.115) 
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point source ‘infinite’ potential 


/ on this surface 


Figure 6.4: Resonance cone excited by oscillating point source in magnetized plasma. 


Resonance cones exist in the following regions of parameter space 
(i) Region 3, where they are called upper hybrid resonance cones; here S < 0, P > 0, 


(ii) Regions 7 and 8, where they are a limiting form of the whistler wave and are also called 
lower hybrid resonance cones since they are affected by the lower hybrid resonance 
(Bellan and Porkolab 1975). For wei <<w << Wye, Wee the P and S dielectric 
tensor elements become 

2 ie 2 


Whe i e 
Pec T Oea n Eo (6.116) 


so that the cone angle Ocone = tan “ir/z is 


S ‘ 
Doone = tan *4/—— = tan (uw? — we.) (wpe + wee’). (6.117) 


If w >> win, the cone depends mainly on the smaller of wpe, Wee. For example, if 
Wee << Wye then the cone angle is simply 


cone = tan ~w /wee (6.118) 
whereas if Wee >> Wye then 
beone ~ tan~ tw/wpe. (6.119) 


For low density plasmas this last expression can be used as the basis for a simple, 
accurate plasma density diagnostic. 


Gii) Regions 10 and 13. The Alfvén resonance cones in region 13 have a cone angle 
bcone = W/\/|WeeWei| and are associated with the electrostatic limit of inertial 
Alfvén waves (Stasiewicz, Bellan, Chaston, Kletzing, Lysak, Maggs, Pokhotelov, 
Seyler, Shukla, Stenflo, Streltsov and Wahlund 2000). To the best of the author’s 
knowledge, cones have not been investigated in region 10 which corresponds to an 
unusual mix of parameters, namely wpe is the same order of magnitude as wci. 
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6.9 Assignments 


. Prove that the cold plasma dispersion relation can be written as 


Ant- Bn? +C=0 
where 
A = Ssin?0 + P cos?0 
B = (S? — D? ) sin ?0 + SP(1 + cos ?0) 
C = P(S? — D?) 
so that the dispersion is 


> B+tvVB?—44C 
2A 


Prove that 
RL = S? — D?. 


. Prove that n? is always real if @ is real, by showing that 


B? — 4AC = |S? — D? — SP]? sin*0 + 4P?D? cos0. 


. Plot the bounding surfaces of the CMA diagram, by defining m./m; = A, £x = 


(we oF wr?) /w?, and y = w2, /w?. Show that 


We _ Le 

w2 1 +X 
so that 

P=1-2 


and S, R, L are functions of x and y with A as a parameter. Hint- it is easier to plot 
x versus y for some of the functions. 


. Plot n? versus w for 0 = 7/2, showing the hybrid resonances. 


. Starting in region 1 of the CMA diagram, establish the signs of S, P, R, L in all the 


regions. 


. Plot the CMA mode lines for plasmas having We >> w2, and vice versa. 


. Consider a plasma with two ion species.By plotting S versus w show that there is an 


ion-ion hybrid resonance located between the two ion cyclotron frequencies. Give an 
approximate expression for the frequency of this resonance in terms of the ratios of 
the densities of the two ion species. Hint- compare the magnitude of the electron term 
to that of the two ion terms. Using quasineutrality, obtain an expression that depends 
only on the fractional density of each ion species. 


. Consider a two-dimensional plasma with an oscillatory delta function source at the 


origin. Suppose that slow waves are being excited which satisfy the electrostatic dis- 
persion 
kS +KP =0 


10. 


6.9 Assignments 209 


where SP < 0. By writing the source on the z axis as 


1 ik,z—iwt 
ae 1K zZ—1W dk, 
f= 5 fe 
and by solving the dispersion to give ky = k,(k,) show that the potential excited in 
the plasma is singular along the resonance cone surfaces. Explain why this happens. 
Draw the group and phase velocity directions. 


What is the polarization (i.e., relative magnitude of E,,E,,E,) of the QTO, QTX, 
and the two QL modes? How should a microwave horn be oriented (i.e., in which way 
should the E field of the horn point) when being used for (i) a QTO experiment, (ii) a 
QTX experiment. Which experiment would be best suited for heating the plasma and 
which best suited for measuring the density of the plasma? 


Show that there is a simple factoring of the cold plasma dispersion relation in the low 
frequency limit w << wei. Hint - first find approximate forms of S, P, and D in this 
limit and then show that the cold plasma dielectric tensor becomes diagonal. Consider 
a mode which has only Fy finite and a mode which only has Æ, and F, finite. What 
are the dispersion relations for these two modes, expressed in terms of w as a function 
of k. Assume that the Alfvén velocity is much smaller than the speed of light. 


Waves in inhomogeneous plasmas and wave 
energy relations 


7.1 Wave propagation in inhomogeneous plasmas 


Thus far in our discussion of wave propagation it has been assumed that the plasma is spa- 
tially uniform. While this assumption simplifies analysis, the real world is usually not so 
accommodating and it is plausible that spatial nonuniformity might modify wave propaga- 
tion. The modification could be just a minor adjustment or it could be profound. Spatial 
nonuniformity might even produce entirely new kinds of waves. As will be seen, all these 
possibilities can occur. 

To determine the effects of spatial nonuniformity, it is necessary to re-examine the orig- 
inal system of partial differential equations from which the wave dispersion relation was 
obtained. This is because the technique of substituting ik for V is, in essence, a shortcut 
for spatial Fourier analysis, and so is mathematically valid only if the equilibrium is spa- 
tially uniform. This constraint on replacing V by ik can be appreciated by considering 
the simple example of a high-frequency electromagnetic plasma wave propagating in an 
unmagnetized three-dimensional plasma having a gentle density gradient. The plasma fre- 
quency will be a function of position for this situation. To keep matters simple, the density 
non-uniformity is assumed to be in one direction only which will be labeled the x direction. 
The plasma is thus uniform in the y and z directions, but non-uniform in the x direction. 
Because the frequency is high, ion motion may be neglected and the electron motion is 
just 


v=- (7.1) 
iwMe 
The current density associated with electron motion is therefore 
2 w(x 
7,2 MOG p a (7.2) 
iwMe iw 
Inserting this current density into Ampere’s law gives 
2 3 : 2 
w l l we. (x 
YxB = pel on sg eee Ba pelz) E1. (1.3) 
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Substituting Ampere’s law into the curl of Faraday’s law gives 


w2 2 

vx vm =$ (1-0) m (1.4) 
c w 

Attention is now restricted to waves for which V - E,; = 0; this is a generalization of 

the assumption that the waves are transverse (i.e., have ik- E = 0) or equivalently are 

electromagnetic, and so involve no density perturbation. In this case, expansion of the left 

hand side of Eq.(7.4) yields 


OF nO 350 w? w2 (2) 
(i EA oa) Bi oe (: e |E =0. (1.5) 


It should be recalled that Fourier analysis is restricted to equations with constant coeffi- 
cients, so Eq.(7.5) can only be Fourier transformed in the y and z directions. It cannot be 
Fourier transformed in the x direction because the coefficient Ge a(t) depends on x. Thus, 
after performing only the allowed Fourier transforms, the wave equation becomes 


o? = w? wpelt) \ = 
(su - 4-82) Ey (x, ky, kz) 4 2 (: 2 E; (z, ky,k:-)=0 (7.6) 


Ox? 


where Ey (x, ky, kz) is the Fourier transform in the y and z directions. This may be rewrit- 
ten as 


32 ' Z 
> + (e)) E(x, ky, kz) =0 T 
where 
2 2 
ary Y Woe (@) 2 2 
k?(z) = i 1 Ae k? — kz. (7.8) 


We now realize that Eq. (7.7) is just the spatial analog of the WKB equation for a pendulum 
with slowly varying frequency, namely Eq.(3.17); the only difference is that the indepen- 
dent variable t has been replaced by the independent variable x. Since changing the name 
of the independent variable is of no consequence, the solution here is formally the same as 
the previously derived WKB solution, Eq.(3.24). Thus the approximate solution to Eq.(7.8) 
is 


E; (x, ky, kz) ed 


x 

: epli f k(x')dz’). (7.9) 
V(x) 
Equation (7.9) shows that both the the wave amplitude and effective wavenumber change 
as the wave propagates in the x direction, i.e. in the direction of the inhomogeneity. It 
is Clear that if the inhomogeneity is in the x direction, the wavenumbers ky and k, do not 
change as the wave propagates. This is because, unlike for the x direction, it was possible 
to Fourier transform in the y and z directions and so ky and k, are just coordinates in 
Fourier space. The effective wavenumber in the direction of the inhomogeneity, i.e., k(x), 
is not a coordinate in Fourier space because Fourier transformation was not allowed in the 
x direction. The spatial dependence of the effective wavenumber «(x) defined by Eq.(7.8) 
and the spatial dependence of the WKB amplitude together provide the means by which 
the system accommodates the spatial inhomogeneity. The invariance of the wavenumbers 
in the homogeneous directions is called Snell’s law. An elementary example of Snell’s 
law is the situation where light crosses an interface between two media having different 
dielectric constants and the refractive index parallel to the interface remains invariant. 
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One way of interpreting this result is to state that the WKB method gives qualified 
permission to Fourier analyze in the x direction. To the extent that such an x-direction 
Fourier analysis is allowed, « (x) can be considered as the effective wavenumber in the x 
direction, i.e., K(x) = k(x). The results in Chapter 3 imply that the WKB approximate 
solution, Eq.(7.9), is valid only when the criterion 


1 dkz 

ky dx 
is satisfied. Inequality (7.10) is not satisfied when kz, — 0, i.e., at a cutoff. At a resonance 
the situation is somewhat more complicated. According to cold plasma theory, k, simply 
diverges at a resonance; however when hot plasma effects are taken into account, it is found 
that instead of having k, going to infinity, the resonant cold plasma mode coalesces with 
a hot plasma mode as shown in Fig.7.1. At the point of coalescence dk, /dx — oo while 
all the other terms in Eq.(7.9) remain finite, and so the WKB method also breaks down at a 
resonance. 

An interesting and important consequence of this discussion is the very real possibility 
that inequality (7.10) could be violated in a plasma having only the mildest of inhomo- 
geneities. This breakdown of WKB in an apparently benign situation occurs because the 
critical issue is how k,,(x) changes and not how plasma parameters change. For example, 
k, could go through zero at some critical plasma density and, no matter how gentle the 
density gradient is, there will invariably be a cutoff at the critical density. 


<< ky (7.10) 


hot plasma 
wave | 


| cold plasma 
wave resonance 


cold plasma wave 
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Figure 7.1: Example of coalescence of a cold plasma wave and a hot plasma wave near the 
resonance of the cold plasma wave. Here a hybrid resonance causes the cold resonance. 
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7.2 Geometric optics 


The WKB method can be generalized to a plasma that is inhomogeneous in more than 
one dimension. In the general case of inhomogeneity in all three dimensions, the three 
components of the wavenumber will be functions of position, i.e., k = k(x). How is the 
functional dependence determined? The answer is to write the dispersion relation as 


D(k,x) =0. (7.11) 
The x-dependence of D denotes an explicit spatial dependence of the dispersion relation 
due to density or magnetic field gradients. This dispersion relation is now presumed to 
be satisfied at some initial point x and then it is further assumed that all quantities evolve 


in such a way to keep the dispersion relation satisfied at other positions. Thus, at some 
arbitrary nearby position x+06x, the dispersion relation is also satisfied so 


D(k+6k, x+6x) = 0 (7.12) 
or, on Taylor expanding, 
oD oD 
Ok: tox = 0. (7.13) 


The general condition for satisfying Eq.(7.13) can be established by assuming that both k 
and x depend on some parameter which increases monotonically along the trajectory of 
the wave, for example the distance s along the wave trajectory. The wave trajectory itself 
can also be parametrized as a function of s. Then using both k = k(s) and x = x(s), it is 
seen that moving a distance 6s corresponds to respective increments 6k = ôs dk/ds and 
ôx = ôs dx/ds. This means that Eq. (7.13) can be expressed as 


dk OD dx OD 
ds Ok 'ds 0x 


The general solution to this equation are the two coupled equations 


6s =0. (7.14) 


dk oD 

a, ES 7.15 
ds Ox’ et) 
dx oD 

= = Tal 
ds Ok 216) 


These are just Hamilton’s equations with the dispersion relation D acting as the Hamil- 
tonian, the path length s acting like the time, x acting as the position, and k acting as the 
momentum. Thus, given the initial momentum at an initial position, the wavenumber evo- 
lution and wave trajectory can be calculated using Eqs.(7.15) and (7.16) respectively. The 
close relationship between wavenumber and momentum fundamental to quantum mechan- 
ics is plainly evident here. Snell’s law states that the wavenumber in a particular direction 
remains invariant if the medium is uniform in that direction; this is clearly equivalent (cf. 
Eq.(7.15)) to the Hamilton-Lagrange result that the canonical momentum in a particular 
direction is invariant if the system is uniform in that direction. 

This Hamiltonian point of view provides a useful way for interpreting cutoffs and reso- 
nances. Suppose that D is the dispersion relation for a particular mode and suppose that D 
can be written in the form 


D(k,x) = So aijkik; + g(w, n(x), B(x)) = 0. (7.17) 


ij 
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If D is construed to be the Hamiltonian, then the first term in Eq.(7.17) can be identified as 
the ‘kinetic energy’ while the second term can be identified as the ‘potential energy’. As an 
example, consider the simple case of an electromagnetic mode in an unmagnetized plasma 
which has nonuniform density, so that 

CK? whe(a) 


14 


D(k,x) = 2 2 


—0. (7.18) 


The wave propagation can be analyzed in analogy to the problem of a particle in a poten- 
tial well. Here the kinetic energy is c?k? /w?, the potential energy is —1 + w?.(a)/w?, and 
the total energy is zero. This is a ‘wave-particle’ duality formally like that of quantum me- 
chanics since there is a correspondence between wavenumber and momentum and between 
energy and frequency. Cutoffs give wave reflection in analogy to the reflection of a particle 
in a potential well at points where the potential energy equals the total energy. As shown 
in Fig.7.2(a), when the potential energy has a local minimum, waves will be trapped in 
the potential well associated with this minimum. Electrostatic plasma waves can also ex- 
hibit wave trapping between two reflection points; these trapped waves are called cavitons 
(the analysis is essentially the same: one simply replaces c? by Bw? Ae): The bouncing 
of short wave radio waves from the ionospheric plasma can be analyzed using Eq.(7.18) to- 
gether with Eqs.(7.15) and (7.16). As shown in Fig.7.2(b) a wave resonance (i.e., k? — 00) 
would correspond to a deep crevice in the potential energy. One must be careful to use geo- 
metric optics only when the plasma is weakly inhomogeneous, so that the waves change 
sufficiently gradually as to satisfy the WKB criterion. 


(a) (b) 


‘ ; > 
total energy potential energy total energy 


l a 


‘potential energy’ 


t 


‘kinetic energy’ 


wave 


a resonance 


Figure 7.2: (a) Effective potential energy for trapped wave; (b) for wave resonance. 


7.3 Surface waves - the plasma-filled waveguide 


An extreme form of plasma inhomogeneity occurs when there is an abrupt transition from 
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plasma to vacuum — in other words, the plasma has an edge or surface. A qualitatively new 
mode, called a surface wave, appears in this circumstance. The physical basis of surface 
waves is closely related to the mechanism by which light waves propagate in an optical 
fiber. 

Using the same analysis that led to Eq.(7.3), Maxwell’s equations in an unmagnetized 
plasma may be expressed as 


VxB=-=PE, Vx E=iwB (7.19) 
c 
where P is the unmagnetized plasma dielectric function 
ae 
P=1- A (7.20) 
w 


We consider a plasma which is uniform in the z direction but non-uniform in the direc- 
tions perpendicular to z. The electromagnetic fields and gradient operator can be separated 
into axial components (i.e. z direction) and transverse components (i.e., perpendicular to 
z) as follows: 


a 
B=B,+B.2, E=E,+E.2, V=V, +2% (7.21) 
Ze 


Using these definitions Eqs.(7.19) become 


_O : iw 5 
(v: + =) x (B: + B,2) = -aP (E: + E,2), 


o 
(v: + =) x (E: + E,2) = lw (By + B,2) 3 
z 


Since the curl of a transverse vector is in the z direction, these equations can be separated 
into axial and transverse components, 


(7.22) 


2- Vix Bi = -5 PE, (1.23) 
2- Vi x E; = iwB,, (1.24) 
ƏB i 
2x Lt L VB; x ê = -E PF, (1.25) 
Oz c2 
dE 
2x ae + VE, x 2 = iwBy. (7.26) 


z 
The transverse electric field on the left hand side of Eq.(7.26) can be replaced using Eq.(7.25) 
to give 


OB 
ð Zx ae + VB, x Z 
. — 3s Zz | a 
iwB; = 2 x 5z 5 HVE; x Z. (7.27) 
a 
It is now assumed that all quantities have axial dependence ~ exp(ikz) so that Eq.(7.27) 


can be solved to give B; solely in terms of E, and B,, i.e., 


2 -1 OB oF 
B= Pee Vi — H PV,E, x ĉl. (7.28) 
Cc Oz Cc 
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This result can also be used to solve for the transverse electric field by interchanging 
—iwP/c? ——> iw and E — B to obtain 


2 “T_ ðE 
E; = (GP es r) ea + iwV,B, x 2| . (7.29) 
C Oz 


Except for the plasma-dependent factor P, these are the standard waveguide equations. An 
important feature of these equations is that the transverse fields E;, B+ are functions of the 
axial fields E,, B, only and so all that is required is to construct wave equations charac- 
terizing the axial fields. This is an enormous simplification because, instead of having to 
derive and solve six wave equations in the six components of E, B as might be expected, it 
suffices to derive and solve wave equations for just £, and B,. 
The sought-after wave equations are determined by eliminating E; and B; from Eqs.(7.23) 

and (7.24) to obtain 


2 —1 š 3 
2- Vix [($r- #) fever. oes PN Bo a ) =- PE, (130) 
C C E 


2 -1 
2- Vi x { (eP- r) ikV BE. + iwV:B; x 4] = iwB,. (1.31) 


In the special situation where V;P x V,B, = ViP x VE, = 0, the first terms in the 
square brackets of the above equations vanish. This simplification would occur for example 
in an azimuthally symmetric plasma having an azimuthally symmetric perturbation so that 
VP, V+E, and V;B, are all in the r direction. It is now assumed that both the plasma 
and the mode have this azimuthal symmetry so that Eqs.(7.30) and (7.31) reduce to 


2 -1 
2- Vix [(žr- r) (PVE, x a} = PE,, (7.32) 
c 

w? = 

2- Vix (GP- r) (V:B: x2) >= B, (7.33) 
or equivalently 
P w? 
Vi $ raea + a rE = 0, (7.34) 
V M ee B + WB =0 (7.35) 
z P= kju |? cee Í 


The assumption that both the plasma and the modes are azimuthally symmetric has 
the important consequence of decoupling the Æ, and B, modes so there are two distinct 
polarizations. These are (i) a mode where B, is finite, but Æ, = O and (ii) the reverse. Case 
(1) is called a transverse electric (TE) mode while case (ii) is called a transverse magnetic 
mode (TM) since in the first case the electric field is purely transverse while in the second 
case the magnetic field is purely transverse. 
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We now consider an azimuthally symmetric TM mode propagating in a uniform cylin- 
drical plasma of radius a surrounded by vacuum. Since B, = O for a TM mode, the 
transverse fields are the following functions of F,: 


19, -1 š 

B; = (GP- r) |-gPvie. x d ; (7.36) 
E C 
w? TI OB 

fe = (S PS i) Te (7.37) 
c Oz 


Additionally, because of the assumed symmetry, the TM mode Eq.(7.34) simplifies to 


10 rP OE, w? 
r Or \ P- keju? Or l 


PE, =0. (7.38) 


Since P is uniform within the plasma region and within the vacuum region, but has different 
values in these two regions, separate solutions to Eq.(7.38) must be obtained in the plasma 
and vacuum regions respectively and then matched at the interface. The jump in P is 
accommodated by defining distinct radial wave numbers 


o= B-SP, (7.39) 
a) 
K = Ro (7.40) 


for the respective plasma and vacuum regions. The solutions to Eq.(7.38) in the respective 
plasma and vacuum regions are linear combinations of Bessel functions of order zero. If 
both of Ke and «2 are positive then the TM mode has the peculiar property of being radially 
evanescent in both the plasma and vacuum regions. In this case both the vacuum and 
plasma region solutions consist of modified Bessel functions Jo, Ko. These solutions are 
constrained by physics considerations as follows: 

1. Because the parallel electric field is a physical quantity it must be finite. In particular, 
E, must be finite at r = 0 in which case only the Tọ(xpr) solution is allowed in the 
plasma region (the Ko solution diverges at r = 0). Similarly, because Æ, must be 
finite as r — oo, only the Ko(k,r) solution is allowed in the vacuum region (the 
Io(Kyr) solution diverges at r = 00). 


2. The parallel electric field Æ, must be continuous across the vacuum-plasma interface. 
This constraint is imposed by Faraday’s law and can be seen by integrating Faraday’s 
law over an area in the r — z plane of axial length L and infinitesimal radial width. 
The inner radius of this area is at r_ and the outer radius is at r} where r- < a < r4. 
Integrating Faraday’s law over this area gives 


PoP Pon ot 


lim ds-V x E = f Bl =— lim ds: -— 


or 
ELL = plasma p =0 
zZ z = 
showing that Æ, must be continuous at the plasma-vacuum interface. 
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3. Integration of Eq.(7.38) across the interface shows that the quantity 


P(P — kc? /w?)”" OE, /Or 


must be continuous across the interface. 
In order to satisfy constraint #1 the parallel electric field in the plasma must be 


I 
B,(r) = B, (a) en (7.41) 
Io(Kpa) 
and the parallel electric field in the vacuum must be 
Ko(Kur) 
E,(r) = E,(a)————.. 7.42 
(r) (a) T (7.42) 


The normalization has been set so that Æ, is continuous across the interface as required by 


constraint #2. 
w? -I OR 
su p kE ogee ; 7.43 
( c2 ) Or ( 
aa _ r=ar 


Constraint #3 gives 
w? = E 
ce 7 r) pos 
c 
Inserting Eqs. (7.41) and (7.42) into the respective left and right hand sides of the above 
expression gives 


Or 
w? 5 a Kplg(Kpa) w? 5 =i Ky K6(Kya) 
(Gr-#) P =(5 r) : (7.44) 

c Ip(Kpa) c Ko(Kya) 
where a prime means a derivative with respect to the argument of the function. This expres- 
sion is effectively a dispersion relation since it prescribes a functional relationship between 
w and k. It is qualitatively different from the previously discussed uniform plasma disper- 
sion relations, because of the dependence on the plasma radius a, a physical dimension. 
This dependence indicates that this mode requires the existence of the plasma-vacuum in- 
terface. The mode amplitude is strongest in the vicinity of the interface because both the 
plasma and vacuum fields decay exponentially on moving away from the interface. 

The surface wave dispersion depends on a combination of Bessel functions and the par- 
allel dielectric P. However, a limit exists for which the dispersion relation reduces to a 
simpler form, and this limit illustrates important features of these surface waves. Specifi- 
cally, if the axial wavelength is sufficiently short for k? to be much larger than both w? P/c? 
and w/c? then it is possible to approximate k? ~ K? ~ Ke so that the dispersion simplifies 
to 


Io(ka) _ K6(ka) 

Tolka) ~~ Kolka)” 
If, in addition, the axial wavelength is sufficiently long to satisfy ka << 1, then the small- 
argument limits of the modified Bessel functions can be used, namely, 


(7.45) 


2 
lim 106) = 145, (7.46) 
lim Ko(€) = —Iné. (7.47) 


E-0 
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Thus, in the limit w? P/c?, w?/c? << k? << 1/a?, Eq.(7.45) simplifies to 


w2.\ ka 1 
1 p pa , 7.48 
( w? | 2 ~ kaln(ka) a) 
Because ka << 1, the logarithmic term is negative. Hence, to satisfy Eq.(7.48) it is neces- 
sary to have w << wpe so that the dispersion further becomes 


eee ee (x). (7.49) 
a 


Whe 2 


On the other hand, if ka >> 1, then the large argument limit of the Bessel functions 
can be used, namely, 
I(E) =, Ko(é)=e* (7.50) 
so that the dispersion relation becomes 


1-2 =-1 (7.51) 


or 


(7.52) 


This provides the curious result that a finite-radius plasma cylinder resonates at a lower 
frequency than a uniform plasma if the axial wavelength is much shorter than the cylinder 
radius. 

These surface waves are slow waves since w/k << c has been assumed. They were 
first studied by Trivelpiece and Gould (1959) and are seen in cylindrical plasmas sur- 
rounded by vacuum. For ka << 1 the phase velocity is w/k ~ O(wpea) and for ka >> 1 
the phase velocity goes to zero since w is a constant at large ka. More complicated varia- 
tions of the surface wave dispersion are obtained if the vacuum region is of finite extent and 
is surrounded by a conducting wall, i.e., if there is plasma for r < a, vacuum fora < r < b 
and a conducting wall at r = b. In this case the vacuum region solution consists of a linear 
combination of Jg9(K,7) and Ko(k,r) terms with coefficients chosen to satisfy constraints 
#2 and #3 discussed earlier and also a new, additional constraint that Æ, must vanish at the 
wall, i.e., at r = b. 


7.4 Plasma wave-energy equation 


The energy associated with a plasma wave is related in a subtle way to the dispersive prop- 
erties of the wave. Quantifying this relation requires starting from first principles regarding 
the electromagnetic field energy density and taking into account specific features of dis- 
persive waves. The basic equation characterizing electromagnetic energy density, called 
Poynting’s theorem, is obtained by subtracting B dotted with Faraday’s law from E dotted 
with Ampere’s law, 


OB 
Ot 


OE 
E-VxB-BVxE= B: (oJ+coro 5 ) HB- 
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and expressing this result as 


Ow 
ay P50 (7.53) 
wae a dE 1. oB 
wW 
— =E-J+eoE-—+—B. 7.54 
at co Ot ig Ot U2) 
and HxB 
poA (7.55) 


Ho 
is called the Poynting flux. The quantities P and ôw/ðt are respectively interpreted as the 
electromagnetic energy flux into the system and the rate of change of energy density of the 
system. The energy density is obtained by time integration and is 


(t) (to) [ate piece BT } 
w = w + -J +E0 E: } > 
i to ao a Mg Ot 
t B? t 
= w(to) +f dtE- J+ Fe] (1.56) 
to 2 2Ho to 


where w(to) is the energy density at some reference time to. 
The quantity E - J is the rate of change of kinetic energy density of the particles. This 
can be seen by first dotting the Lorentz equation with v to obtain 


d 
mv - < = qv- (E +v x B) (1.57) 


or 
Elg 
— | -—mv* | = qE - v. (7.58) 


Since this is the rate of change of kinetic energy of a single particle, the rate of change of 
the kinetic energy density of all the particles, found by summing over all the particles, is 


d 
— (kinetic energy density) = dv folo E- v 
dt 
T 5 No qoE ` Uc 
= E-J. (7.59) 


This shows that positive E - J corresponds to work going into the particles (increase 
of particle kinetic energy) whereas negative E- J corresponds to work coming out of the 
particles (decrease of the particle kinetic energy). The latter situation is obviously possible 
only if the particles start with a finite initial kinetic energy. Since E - J accounts for changes 
in the particle kinetic energy density, w must be the sum of the electromagnetic field density 
and the particle kinetic energy density. 

The time integration of Eq.(7.59) must be done with great care if E and J are wave 
fields. This is because writing a wave field as Y = wo exp(ik - x — iwt) must always be un- 
derstood as a notational convenience which should never be taken to mean that the actual 
physical wave field is complex. The physical wave field is always real and so it is always 


understood that the physically meaningful variable is Y = Re [ý exp(ik - x — iwt)| . This 
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taking of the real part is often not explicitly stated in linear relationships where its omission 
does not affect the mathematical logic. However, taking the real part is a critical step in 
nonlinear relationships, because for nonlinear relationships omitting this step causes seri- 
ous errors. In particular, when dealing with a product of two oscillating physical quantities, 
say w(t) = Re [be 
as 


ne 


and x(t) = Re [xet] , it is essential to write the product 


w(t)x(t) = Re [pe] x Re [xe "] : (7.60) 


that is the real part of the factors must always be established before calculating the product. 
If w = wr + iw; is a complex frequency, then the product in Eq.(7.60) assumes the form 


—2iwt * ~ x Diw*t 


pe >t 4 Xe 
+ý%*e -i(w-w “ergy ye eu) | Š (7.61) 


ha~ iwt a iw* t —iwt * iw Me 
| ye t ye xe +e 
vOxt) = (: (oe) 
1 
4 


When considering the energy density of a wave, we are typically interested in time- 
average quantities, not rapidly fluctuating quantities. Thus the time average of the product 
w(t)x(t) over one wave period will be considered. The wy and Y %* terms oscillate at 
the fast second harmonic of the frequency and vanish upon time-averaging. In contrast, the 
UX" and wx terms survive time-averaging because these terms have no oscillatory factor 


since w — w* = 2iw;. Thus, the time average (denoted by ()) of the product is 


(WO) = 7x +d ne" 


1 r 
= 5Re (ax) erwit, (7.62) 


this is the desired rule for time-averaging products of oscillating quantities. 


7.5 Cold-plasma wave energy equation 
The current density J in Ampere’s law consists of the explicit plasma currents which are 
frequency-dependent. This frequency dependence means that care is required when inte- 


grating E - J. In order to arrange for this integration we express Eq.(6.9) and Eq.(6.10) 
as 


T iw o mn iw 
= Jette (Be ) (7.63) 


where J consists of the plasma currents and the time dependence is shown explicitly. 


222 Chapter 7.. Waves in inhomogeneous plasmas and wave energy relations 


Integration of Eq.(7.56) taking into account the prescription given by Eq.(7.60) has the 
form 


t E(x,t)- (1% pig ch 
w(t) = w(—oo)+ af a 
L +B 1) PBs t) 
Be-iot.eg 2 (R Fe- iwt E 
= w(—00) +5 n af a A ) es 
Ho 


where c.c. means complex conjugate. The term containing the rates of change of the 
electric field and the particle kinetic energy can be written as 


OE = i re 
(E (3) ) = B {Bert (wR Bre") tec} 


= 2f iw* K*-E* — B* wK Bh east 
= = = a E 
z iw, |E- K*-E*-E*. K.-E 
0 2wit 
Sab doa Bi EE Nets ae oa er 
4 | +u,/E-K*-E*+E*-K.- 
(7.65) 
To proceed further it is noted that 
B. K'E’ =X Ë, K} Ë} = xt Kit =§*-K'-B (7.66) 


where the superscript t means transpose and the dagger superscript + means Hermitian 


conjugate, i.e., the complex conjugate of the transpose. Thus, Eq.(7.65) can be re-written 
as 


OE = 1 z 2 i _ 
(£ (a+ =) == [io E" (K-K) - E+w;E* - (K'+K) -Ë| ere, 


(7.67) 
Both the Hermitian part of the dielectric tensor, 
=> 1 /e SS 
K,=5(K+K’), (7.68) 
and the anti-Hermitian part, 
=> lo Ss, 
Kin=5(K-K’), (7.69) 


occur in Eq. (7.67). The cold plasma dielectric tensor is a function of w via the functions 
S, P, and D, 
S(w) —iD(w) 0 
= 
K(w)= | iDw) S(w) 0 : (7.70) 
0 0 P(w) 
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If w; = O then S, P, and D are all pure real. In this case Kw) is Hermitian so that 


Ke 
K, = K and Kan = 0. However, if w; is finite but small, then K (w) will have a small 
non-Hermitian part. This non-Hermitian part is extracted using a Taylor expansion in terms 
of w;, i.e., 


iy a Oes 
K (w, + iwi) = K (wr) + iwi E K | : (7.71) 
Ww w=Wwr 
The transpose of the complex conjugate of this expansion is 
P ə 
R (w, +iwi)| = K (w) — iw; Fase) (7.72) 
Ww WE 


since K is non-Hermitian only to the extent that w; is finite. Substituting Eqs.(7.71) and 
(7.72) into Eqs.(7.68) and (7.69) and assuming small w; gives 


<> ed 
K, = K(w,) (7.73) 
and 
a . On 
Kan = iwi E K (w) =, (7.74) 
Inserting Eqs. (7.73) and (7.74) in Eq.(7.67) yields 
OE ral 2EQW; .|°9 - «a, -| out 
(£ (3+ =) = A rE ao] E + E*- K (w,)-E] e 
= 2E0wi * ð Mao2wit 
= 1 E eR (| _ Ee’. 
(7.75) 
Similarly, the rate of change of the magnetic energy density is 
1 _ B 1 -, = 
B. ) >F [2wiB* -B| nt, (1.16) 
Ho ôt dho 


Using Eqs.(7.75) and (7.76) in Eq.(7.64) gives 


w = w(—00) 4 E (-Kw)| bat al} f dt jet 


—co 
(7.77) 
which now may be integrated in time to give the total energy density associated with bring- 
ing the wave into existence 


w = w—w(-oo) 
econ, [9/7 o E E a eel ee 
= {ope A K (. ) -Ë z |B’ -B| wit (7.78 
fs Fae Le. + et (7.78) 
In the limit w; — 0 this reduces to 
p= OB [WR ) Z+ 20 (7.79) 
w=] zg PKO o ; 
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Since the energy density stored in the vacuum electric field is 


E 2 
jee (7.80) 
4 
and the energy density stored in the vacuum magnetic field is 
Bl? 
wg = el (7.81) 
4ko 
the change in particle kinetic energy density associated with bringing the wave into exis- 
tence is 5 
= £0 = y a> wo = 
part = PË. E (oR (w)) -TÈ (1.82) 


Although this result has been established for the general case of the dielectric ten- 


= 
sor K (w) of a cold magnetized plasma, in order to appreciate its meaning it is use- 
ful to consider the simple example of high frequency electrostatic oscillations in an un- 
magnetized plasma. In this simplest case S = P = 1 — wee /w? and D = 0 so that 


1 2 Dae as : : : 
K (w) = (1- Wre/W ) I. Since the oscillations are electrostatic, wg = 0. The energy 
density of the particles is therefore 


at pe col E|? o 3 wee 
Wpart = 1 a w Pom -1 


ee 
— eol” h- (7.83) 
- = 
_ e0|EP 
4 


where the dispersion relation 1 — we /w* = Ohas been used. Thus, for this simple case, 
half of the average wave energy density is contained in the electric field while the other half 
is contained in the coherent particle motion associated with the wave. 


7.6 Finite-temperature plasma wave energy equation 


The dielectric tensor does not depend on the wavevector k in a cold plasma, but does 
in a finite temperature plasma. For example, the electrostatic unmagnetized cold plasma 
dielectric P(w) = 1 — w?,/w? becomes P(w, k) = 1 — (1+ Bk Aj. )wee/w? in a warm 
plasma. The analysis of the previous section will now be generalized to allow for the 
possibility that the dielectric tensor depends on k as well as on w. In analogy to the method 
used in the previous section for treating complex w, here k will also be assumed to have 
a small imaginary part. In this case, Taylor expansion of the dielectric tensor and then 
extracting the anti-Hermitian part shows that the anti-Hermitian part is 

OL 


+ ik; - ARo 


a> N 0 
Kan = wi É ak 


K (w,k 
Ow (w, k) 


(7.84) 


w=w,,k=k,. | w=w,,k=k, 
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while the Hermitian part remains the same. There is now a new term involving k;.With the 
incorporation of this new term, Eq.(7.75) becomes 


2 Wi = (6) = 
SON He LYK (w) E+ 
oE 4 ae w=w 2k; x+2 
E- STO a, = baie Aas 7 LA ereat 
OR. |Qwk; - —K (w,k -Ë 
4 [2 ag | 


(7.85) 
where we have explicitly written the exponential space-dependent factor exp (—2k; - x) . 
What is the meaning of this new term involving k;? The answer to this question may be 
found by examining the Poynting flux for the situation where k; is finite. Using the product 
rule to allow for finite k; shows that 


(V-P) = (V-(ExH)) 
1 £ Ae oe 
ere [(E xH+Ex H’) giaa] 
1 z Ay E 
=; (-2k, (E xH+Ex ft’)) en 2k xt wit, (7.86) 


Comparison of Eqs.(7.85) and (7.86) show that the factor —2k;- acts as a divergence and 
so the second term in Eq.(7.85) represents an energy flux. Since the Poynting vector P is 
the energy flux associated with the electromagnetic field, this additional energy flux must 
be identified as the energy flux associated with particle motion due to the wave. Defining 
this flux as T it is seen that 


Tj= -—E" - Fas) -È (7.87) 


in the limit k; — 0. For small but finite k;, w; the generalized Poynting theorem can be 
written as 
—2k; : (P + T) + 2w;(wg + wg + Ūpart) = 0. (7.88) 
We now define the generalized group velocity v, to be the velocity with which wave 
energy is transported. This velocity is the total energy flux divided by the total energy 
density, i.e., 
P+T 
WE + we + Wpart f 
The bar in the particle energy represents the fact that this is the difference between the 
particle energy with the wave and the particle energy without the wave and it should be 
recalled that this difference can be negative. 


= (7.89) 


7.7 Negative energy waves 


A curious consequence of this analysis is that a wave can have a negative energy density. 
While the field energy densities wg and wg are positive definite, the particle energy den- 
sity Wpart Can have either sign and in certain circumstances can be sufficiently negative 
to make the total wave energy density negative. This surprising possibility can occur be- 
cause, as was shown in Eq.(7.78), the wave energy density is actually the change in the 
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total system energy density in going from a situation where there is no wave to a situation 
where there is a wave. Typically, negative energy waves occur when the equilibrium has a 
steady-state flow velocity and there exists a mode which causes the particles to develop a 
slower mean velocity than in steady state. Wave growth taps free energy from the flow. 

As an example of a negative energy wave, we consider the situation where unmagne- 
tized cold electrons stream with velocity vo through a background of infinitely massive 
ions. As shown earlier, the electrostatic dispersion for this simple 1D situation with flow 
involves a parallel dielectric involving a Doppler shifted frequency, i.e., the dispersion re- 


lation is 
2 


w e 
P(w,k) = 1—-——®*_,, =0. (7.90) 
(w — kvo) 
{> ad 
Since the plasma is unmagnetized, its dielectric tensor is simply K (w,k) = P(w,k) I. 
Using Eq.(7.79), the wave energy density is 


o| E|? ð eol E|? wwe 
w = — (wP(w,k)) = : (7.91) 
4 Ow ( ( )) 2 (w ees kvo)? 
However, the dispersion relation, Eq.(7.90), shows that 
w = kvo £ Wye (7.92) 
so that Eq.(7.91) can be recast as 
E|? k 
ga Ml (1 + ae (7.93) 
2 Upe 


Thus, if kvo > wpe and the minus sign is selected, the wave has negative energy density. 

This result can be verified by direct calculation of the change in system energy density 
due to growth of the wave. When there is no wave, the electric field is zero and the system 
energy density w*¥* is simply the beam kinetic energy density 


1 
wo = groeve: (1.94) 


Now consider a one dimensional electrostatic wave with electric field E=Re (Belk te) : 


The system average energy density with this wave is 


E|? 1 
T = “all | + (5r(ast)meo(e.t)) (7.95) 


so that the change in system energy density due to the wave is 


ays tE El? 1 ‘ 
= Wanye — Wo” = a : , G [no H n(x, t)] me [vo +(x, t)? 


mss 


l 2 
— govo 


(1.96) 
where 
n1(x,t) = Re (feiet), v1 (x,t) = Re (čet iet) . (1.97) 
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Since odd powers of oscillating quantities vanish upon time averaging, Eq.(7.96) becomes 


E|? 1 i 
ge? ea. IE t | + NoMe Ge + Tuwo) 
0 
E0 E 2 1 
= | + gnome (vt) + Mevo (nv). (7.98) 


The linearized continuity equation gives 


—i(w — kvo)ñ + noikt = 0 (7.99) 
or 7 ae 
Leen ee (7.100) 
no w — kvo 
The fluid quiver velocity in the wave is 
$ qË 
= 7.101 
£ —i(w — kvo)me ( ) 
so that Si 
1 gE 
2 
=. 7.102 
(v1) 2 (w — kvo)? m2 pudga) 
and mae 
k nog E 
aU pier 7.103 
(ram) 2 (w — kvo)3m2 ( ) 
We may now evaluate Eq.(7.98) to obtain 
oY = E0 E/? TEN eE? i kvo eE? 
4 e | 4(w — kup)?2m2 © 2 (w= kvo)3m? 
= & E|? : we Qu)? ekvo 
= 4 (w — kvo)? * (w — kvo)? 
E|? k 
E E i + | (7.104) 
2 Wye 


where Eq.(7.92) has been invoked repeatedly. This is the same as Eq. (7.93). The energy 
flux associated with this wave shows is also negative (cf. assignments). However, the group 
velocity is positive (cf. assignments) because the group velocity is the ratio of a negative 
energy flux to a negative energy density. 

Dissipation acts on negative energy waves in a manner opposite to the way it acts 
on positive energy waves. This can be seen by Taylor-expanding the dispersion relation 
P(w, k) = 0 as done in Eq.(5.83) 

P; 


Expanding Eq.(7.91) gives 
cowl E|? OP 
4 w 


(7.106) 


D = 
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so a negative energy wave has wOP/Ow < 0. If the dissipative term P; is the same for both 
positive and negative waves, then for a given sign of w , the critical difference between 
positive and negative waves is due to the sign of 0P/Ow. Equation (7.105) shows that w; 
will have opposite signs for positive and negative energy waves. Hence, dissipation tends 
to drive negative energy waves unstable. Since there is usually some dissipation in any real 
system, a negative energy wave will generally spontaneously develop if it is an allowable 
mode and will grow at the expense of the free energy in the system (e.g., the free energy in 
the streaming particles). 


7.8 Assignments 


1. Consider the problem of short wave radio transmission. Let x be the horizontal di- 
rection and z be the vertical direction. A short wave radio antenna is designed in 
such a way that it radiates most of the transmitter power into a specified k, and kz 
at the antenna. This is determined essentially by the Fourier transform of the antenna 
geometry. 


(1) What is the frequency range of short wave radio communications? 


(ii) For ionospheric parameters, and the majority of the short wave band should the 
ionospheric plasma be considered magnetized or unmagnetized? 


(111) What is the appropriate dispersion for short wave radio waves (hint-it is very sim- 
ple) 

(iv) Using Snell’s law and geometric optics, sketch the trajectory of a radio wave 
showing what happens at the ionosphere. 


2. Using geometric optics discuss qualitatively with sketches how a low frequency wave 
could act as a lens for a high frequency wave. 


3. For the example of an electrostatic electron plasma wave [dispersion w? = beet + 


3k? r2.,)] show that the generalized group velocity as defined in Eq.(7.89) gives the 
same group velocity as found using the previous definition based on 0w/Ok. 


4. Calculate the wave energy density, wave energy flux, and group velocity for the elec- 
trostatic wave that can exist when a beam of cold electrons having velocity vo streams 
through a neutralizing background of infinitely massive ions. Discuss the signs of 
these three quantities. 


Vlasov theory of warm electrostatic waves in 
a magnetized plasma 


8.1 Uniform plasma 


It has been tacitly assumed until now that the wave phase experienced by a particle is 
just what would have been experienced if the particle had not deviated from its initial 
position xp. This means that the particle trajectory used when determining the wave phase 
experienced by the particle is x = Xo instead of the actual trajectory x = x(t). Thus the 
wave phase seen by the particle was approximated as 


k- x(t) — wt =k- Xo — wt. (8.1) 


This approximation is fine provided the deviation of the actual trajectory from the assumed 
trajectory satisfies the condition 


|k- (x(t) — xo) | << 1/2 (8.2) 


so any phase error due to the deviation is insignificant. Two situations exist where this 
assumption fails: 
1. the wave amplitude is so large that the particle displacement due to the wave is signif- 
icant compared to a wavelength, 


2. the wave amplitude is small, but the particle has a large initial velocity so that it moves 
substantially during one wave period. 

The first case results in chaotic particle motion as discussed in Sec.3.7.3 while the 
second case, the subject of this chapter, occurs when the particles have significant thermal 
motion. If the motion is parallel to the magnetic field, significant thermal motion means 
that vr is non-negligible compared to w/k, a regime already discussed in Sec.5.2 for 
unmagnetized plasmas. Thermal motion in the perpendicular direction becomes an issue 
when ki rr ~ 7/2, i.e., when the Larmor orbit rz, becomes comparable to the wavelength. 
In this situation, the particle samples different phases of the wave as the particle traces out 
its Larmor orbit. The subscript || is used here to denote the direction along the magnetic 
field. If the magnetic field is straight and given by B =Bz2, || would simply be the z 
direction, but in a more general situation the || component of a vector would be obtained 
by dotting the vector with B. 
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Consider an electrostatic wave with potential 
(x,t) = , exp(ik - x—iwt). (8.3) 


As before the convention will be used that a tilde refers to the amplitude of a perturbed 
quantity; if there is no tilde, then the exponential phase factor is understood to be included. 
Because the wave is electrostatic, Poisson’s equation is the relevant Maxwell’s equation 
relating particle motion to fields, i.e., 


1 
Kb, = = D Modo (8.4) 


where 7,1 is the density perturbation for each species ø. Since the density perturbation is 
just the zeroth moment of the perturbed distribution function, 


na = f fado, (8.5) 


the problem reduces to determining the perturbed distribution function f,1. 
In the presence of a uniform magnetic field the linearized Vlasov equation is 


Ofer yfo i do (v xB). fa = qo Vo, - ĉa 
ot Ox Mo Ov Mo Ov 
where the subscript 0 refers to equilibrium quantities and the subscript 1 to first-order per- 
turbations (no 0 has been used for the magnetic field, because the wave has been assumed 
to be electrostatic and so does not have any perturbed magnetic field, thus B is the equilib- 
rium magnetic field). 

Consider an arbitrary point x, v in phase space at time t. All particles at this point x, v 
at time t have identical phase-space trajectories in both the future and the past because the 
particles are subject to the same forces and have the same temporal initial condition. By 
integrating the equation of motion starting from this point in phase space, the phase-space 
trajectory x(t’), v(t’) can be determined. The boundary conditions on such a phase-space 
trajectory are simply 


(8.6) 


x(t) =x, v(t)=Vv. (8.7) 

Instead of treating x, vas independent variables denoting a point in phase space, let us 
think of these quantities as temporal boundary conditions for particles with phase-space 
trajectories x(t), v(t) that happen to be at location x, v at time t. Thus, the velocity distri- 
bution function for all particles that happen to be at phase-space location x, v at time t is 
foi = fo1(x(t), v(t), t) and since x and v were arbitrary, this expression is valid for all 
particles. The time derivative of this function is 

d fo Ofo1 dx Ofo1 dv 

ONO = “Sp + See G+ Se S 
In principle, one ought to take into account the wave force on the particles when calculating 
their trajectories. However, if the wave amplitude is small enough, the particle trajectory 
will not be significantly affected by the wave and so will be essentially the same as the 
unperturbed trajectory, namely the trajectory the particle would have had if there were no 
wave. Since the unperturbed particle trajectory equations are 


(8.8) 


dx dv qo 
Tap ap e ( 
dt dt Mo 


v x B) (8.9) 
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it is seen that Eq.(8.8) is identical to the left hand side of Eq.(8.6). Equation (8.6) can thus 
be rewritten as 


d qo 

(<p fora. ¥(9.0) = — V9, - 5 (8.10) 
unperturbed trajectory y 

where the left hand side is the derivative of the distribution function that would be mea- 

sured by an observer sitting on a particle having the unperturbed phase-space trajectory 

x(t), v(t). Equation (8.10) may be integrated to give 


t 
foi(x, v, t) = 2 fæ [ve oie (8.11) 


T N | x=x(t’),v=v(t') 


If the right hand side of Eq.(8.10) is considered as a ‘force’ acting to change the perturbed 
distribution function, then Eq.(8.11) is effectively a statement that the perturbed distribu- 
tion function at x, v for time ¢ is a result of the sum of all the ‘forces’ acting over times 
prior to t calculated along the unperturbed trajectory of the particle. “Unperturbed trajecto- 
ries” refers to the solution to Eqs.(8.9); these equations neglect any wave-induced changes 
to the particle trajectory and simply give the trajectory of a thermal particle. The ‘force’ 
in Eq.(8.11) must be evaluated along the past phase-space trajectory because that is where 
the particles at x, v were located at previous times and so that is where the particles ‘felt’ 
the ‘force’. This is called “integrating along the unperturbed orbits” and is only valid when 
the unperturbed orbits (trajectories) are a good approximation to the particles’ actual or- 
bits. Mathematically speaking, these unperturbed orbits are the characteristics of the left 
hand side of Eq.(8.6), a homogeneous hyperbolic partial differential equation. The solu- 
tions of this homogeneous equation are constant along the characteristics. The right hand 
side is the inhomogeneous or forcing term and acts to modify the homogeneous solution; 
the cumulative effect of this force is found by integrating along the characteristics of the 
homogeneous part. 

The problem is now formally solved; all that is required is an explicit evaluation of the 
integrals. The functional form of the equilibrium distribution function is determined by the 
specific physical problem under consideration. Often the plasma has a uniform Maxwellian 
distribution ni DRE" 

foo(v) = Ta te 


Ute = V2KT,/Me- (8.13) 
It must be understood that Eq.(8.12) represents one of an infinity of possible choices for 
the equilibrium distribution function — any other function of the constants of the motion 
would also be valid. In fact, functions differing from Eq.(8.12) will later be used to model 
drifting plasmas and plasmas with density gradients. 
Substitution of Eq.(8.12) into the orbit integral gives 


(8.12) 


where 


Node, 
foi(X,v,t) = 3/2, se exp | v? [vig | 
oVTo (8.14) 


7 J s dt! (Eyn gaesti iarl} 


To 
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Since the kinetic energy mv? /2 of the unperturbed orbits is a constant of the motion, the 
quantity exp [-v? / | has been factored out of the time integral. This orbit integral may 
be simplified by noting 


< exp [ik - x(¢’)] = ik - v(t’) exp [ik - x(t’)] (8.15) 


so that Eq.(8.14) becomes 


t 
finv) = -Eo fa {exp [iot] £ exp fk xe} 


-E fao {lexp (ik - x(t") — iwt!)]° | + iwTphase(x,t)} 
“= (8.16) 
where the phase-history integral is defined as 
t 
phase Kt) = / dt' exp (ik - x(t’) — iwt’). 
Evaluation of phase requires knowledge of the unperturbed orbit trajectory x(t’). This 


trajectory, determined by solving Eqs.(8.9) with boundary conditions specified by Eq.(8.7), 
has the velocity time-dependence 


v(t') = vB + vı cos [wes (t! — t)] — Ê x vı sin [woo (t' — t)]. (8.17) 


Equation (8.17) satisfies both the dynamics and the boundary condition v(t) = v and so 
gives the correct helical ‘unwinding’ into the past for a particle at its present position in 
phase space. The position trajectory, found by integrating Eq.(8.17), is 


x(t) = x+v) (t’-1)B 
+- {vu sin [Wee (t! = t)] Se Ê x VL (cos [Wea (t! = t)] = 1)} 


Weg 


(8.18) 


which satisfies the related boundary condition x(t) = x. 

To proceed further, we define ọ to be the velocity-space angle between the fixed quan- 
tity kų and the dummy variable v, so thatk, -vı =k, v, cosy and k, -Bxv, = 
kv] sing. Using this definition, the time history of the spatial part of the phase can be 
written as 
Kivy 


The phase integral can now be expanded 


k- x(t’) =k- xk) vy (t — t) + 


{sin [w.,(t' — t) + y] — sin y} . (8.19) 


t 
Ipnase(Xt) = eo I gr'a xe) iot) 
—co 


(kjuj —w) T+ 


0 
aia | drexp<i}] Kyu, 
—co 


t 
co 


{sin [wcoT + y] — sin y} 
(8.20) 
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where 7 = t' — t. Certain Bessel function relations are now of use. The first of these is the 
integral representation of the J, Bessel function, namely 


1 27 . . f 
Jn(Z) T ered in q0, (8.21) 


The inverse of this relation is 


ez sin 0 2 5 Jz (z)eir? (8.22) 
which may be validated by taking the 0 Fourier transform of both sides over the interval 
from 0 to 27. 

The phase integral can be evaluated using Eq.(8.22) in Eq.(8.20) to obtain 


0 
ed ik, v sin 
Iphase(Xt) = cikx—iwt y Jy (e+) en ikivising n if drei( Ai 2) T+in(weot +9) 
; ; 0 

ikpuysing © fein eee 2) 
ik-x—iwt .— kiv Z2 
= e e Weco Jn ( LYI ) z 

2. mee —i (w = kjv] — Nwe) 
(8.23) 
In Eqs. (8.16) and (8.23) there is a lower limit at t = —oo; this limit corresponds to the 


phase in the distant past and is essentially the information regarding the initial condition of 
the system in the distant past. We saw in our previous discussion of the Landau problem 
that initial value problems are properly discussed using a Laplace transform approach and 
that Fourier transforms do not work properly when there is an initial value problem. How- 
ever, if we ignore the initial conditions and use the Plemelj formula with Fourier transforms 
the same result as the Laplace method is obtained. This short-cut procedure will now be fol- 
lowed and so any reference to the initial conditions will be dropped and Fourier transforms 
will be used with invocation of the Plemelj formula whenever it is necessary to resolve any 
singularities in the integrations. Hence, terms evaluated at t = —oo in Eqs. (8.16) and 
(8.23) are dropped since these are initial values. After making these simplifications, the 
perturbed distribution function becomes 


kjv 


_ Goris = ®t _ikivisineg n( Os weir? 
foi(x, V,t) = = 1 e Weo 
KT m (w= kjv] = nwo) 


(8.24) 
The next step is to evaluate the density perturbation Eq.(8.5), an operation which involves 
integrating the perturbed distribution function over velocity; the velocity integrals are eval- 
uated using the Bessel identity (Watson 1922) 


g 1 2 
f zJ2(bz)e %7 dz = eae A: (8.25) 
0 
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Using this identity, substitution of Eq.(8.24) into Eq.(8.5) gives 


2m 
i 1— aE, 2 af ay f vi du, 


KT, na ju a A (5> Jew (v P+? vho 


(w — kjv — nwo) 


vatu? 
2( RLU = 
‘ee 


qo Nc0 Ves a Weg 
1 dv vdv 
kTo af TOS: me l = Rí (w — kjv = NW ea ) 


nale of 
KT, ue 


2. 2 2 In (ki rd.) oo eE? 
1 + aose ki p> g1/2 (E TR 


~ aa P10 1+ Adc -kirī Lo Soil ka 72.) Zone) 
(8.26) 
where 
Qno = ee ee (8.27) 
kyUre 


the Larmor radius is 


ice Io Me, (8.28) 


Weco 
and Z is the plasma dispersion function. Finally, Eq.(8.26) is substituted into Eq.(8.4) to 
obtain the warm magnetized plasma electrostatic dispersion relation 


D(w,k) =14 v= 1 + apse Fite NO In (Kir7,) Z(Onc)| =0. (8.29) 
o Do 


Note that D(w,k) refers to the dispersion relation and should not be confused with D 
the off-diagonal term of the cold-plasma dielectric tensor, nor with D the displacement 
vector. A similar, but more involved calculation using unperturbed orbit phase integrals 
for the perturbed current density gives the hot plasma version of the full electromagnetic 
dispersion, i.e., the finite temperature generalization of the cold plasma dielectric tensor 
K. Although the calculation is essentially similar, it is considerably more tedious, and 
the interested reader is referred to specialized texts on plasma waves such as those by Stix 
(1992) or Swanson (2003). 


8.2 Analysis of the warm plasma electrostatic dispersion 
relation 


Equation (8.29) generalizes the unmagnetized warm plasma electrostatic dispersion rela- 
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tion Eq.(5.51) derived in Chapter 4 to the situation of a magnetized warm plasma. This 
means that Eq.(8.29) should revert to Eq.(5.51) in the limit that B — 0. The Bessel iden- 
tity 
yy ase (8.30) 
together with the condition a,, — ag if B — 0 show that this is indeed the case. 
Furthermore, Eq.(8.29) should also be the warm-plasma generalization of the cold, 
magnetized plasma, electrostatic dispersion 


k2S + k?P = 0; (8.31) 


demonstration of this correspondence will be presented later. Equation (8.30) can be used 
to recast Eq.(8.29) as 


co 
kris 


D(w,k) =1+ rove TA XO L (kirle) [L+ aooZ(ano)] = 0. (8.32) 


n=—Cco 


Using I_»(z) = In (z), the summation over n can be rearranged to give 


—k? Io (kîr?e) (1 F aoo Z (a00)) 


tis 


PO) cas End Kir?) [2 + 002 {Z(ane) + E 


(8.33) 
In order to obtain the lowest order thermal correction, it is assumed that ay = w/ kyvre >> 
1 in which case the large argument expansion Eq.(5.73) can be used to evaluate Z (œo). In- 
voking this expansion and keeping only lowest order terms, shows that 


1 1 . 
1+aoZ(ao) = 14 af a h } za } a } int? exp(—08) | 
0 
1 P 
= - 5 t+ iaon"? exp(—a@). (8.34) 
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It is additionally assumed that |a,| = |(w — nwee)/kyvTo| >> 1 for n # 0; this cor- 
responds to assuming that the wave frequency is not too close to a cyclotron resonance. 
With this assumption the large argument expansion of the plasma dispersion function is 
also appropriate for the n Æ Oterms, and so one can write 


2+a90[Z(an)+Z(a_n)] = 24 a0 | : : Lilt (oct terea) 


An An 


w w 
= 2 H ido. T (e= + en) 
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where the subscript ø has been omitted from the a,, to keep the algebra uncluttered. Sub- 
stitution of these expansions back into the dispersion relation gives 


Te se. 
Io (Ki rT) (-zz F iao VF exp( 08.) ) 


2.2 2.32. 
Ekite 2n wis 


ey) 2,2 
a kA co Wi — N Wio 


n=1 ; eTa 
+Hiadg yT n2 
+e ">" 


= 0. (8.36) 


D(w, k) 


Equation (8.36) generalizes the magnetized cold plasma electrostatic dispersion to finite 
temperature and shows how Landau damping appears both at the wave frequency w and 
also at cyclotron harmonics, i.e., in the vicinity of nw,,. Two important limits of Eq.(8.36) 
are discussed in the following sections. 


8.3 Bernstein waves 


Suppose the wave phase is uniform in the direction along the magnetic field. Such a situa- 
tion would occur if the antenna exciting the wave is an infinitely long wire aligned parallel 
to a magnetic field line (in reality, the antenna would have to be sufficiently long to be- 
have as if infinite). This situation corresponds to having kj — 0 in which case the Landau 
damping terms and the 1/202, term vanish. The dispersion relation Eq.(8.36) consequently 
reduces to 


ere SS 2n?w? 
P= PO— Tn (Ac 8.37 
~ Ag Prepare (Ac) ( ) 
where 

`o = kî ro (8.38) 

and : 

2 
Dial Wee ie pon KT, 

Tig = oe = fe (8.39) 


is the Larmor radius. The waves resulting from this dispersion were first derived by Bern- 
stein (1958) and are called Bernstein waves or cyclotron harmonic waves; these waves 
are often called hot plasma waves because their existence depends on the plasma having 
a finite temperature. Both electron and ion Bernstein waves exist; early measurements of 
electron Bernstein waves in laboratory experiments were reported by Crawford (1965) and 
by Leuterer (1969) and of ion Bernstein waves were reported by Schmitt (1973). These 
waves have also been routinely observed by spacecraft and fitting of the measurements to 
the wave dispersion has been used, for example by Moncuquet, Meyervernet and Hoang 
(1995), to infer the electron temperature in the magnetized plasma of Jupiter’s moon Io. 
Electron Bernstein waves involve w being in the vicinity of an electron cyclotron harmonic, 
i.e., W? ~ O(n?w?.), while for ion waves w is in the vicinity of an ion cyclotron harmonic. 
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Figure 8.1: Electron Bernstein wave dispersion relations 


The Bernstein wave dispersion relation, Eq. (8.37), provides a transcendental relation 
between w and k; and for any given k1 has an infinite number of roots w . This is because 
as w is varied, the factor (2? — nu?) ~" in each term of the summation takes on all values 
from —ooto oo so that there are an infinite number of ways for the right hand side to equal 
the left hand side. In particular, it should be noted that the right hand side becomes large 
and positive whenever w is just slightly larger than nw,, for any n. 

We now consider electron Bernstein waves and so drop the ion terms from the summa- 
tion (the analysis for ion Bernstein waves is similar). The subscript o will now be dropped 
and it will be understood that all quantities refer to electrons. On keeping the electron 
terms only, Eq.(8.37) reduces to 


eo oo n2w2 
E2 P Ia (X 8.40 
n err (Aà) (8.40) 


which has the following limiting forms when we << w? and vice versa. 
l. w2 << w? case: 


Here the term 2n? w? /(w? — n?w?) is negligible compared to unity except when w? ~ 


O(n?w2,). Thus, for a given w only one term in the summation is near resonance and 
the dispersion relation is satisfied by this one term on the right hand side of Eq.(8.40) 
balancing the left hand side. This results in a spectrum waves that are slightly above 
the cyclotron harmonics, namely 


= 
W? = nw? + 2 nw In (A), forn =1,2,...,00. (8.41) 
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The small and large A limits of this expression are determined using the asymptotic 
values of the I,, Bessel function, namely for n > 1, 


lim (à) = 2 à 
A< <1 n! \2 (8.42) 


to give 


n-1 
w? = nu? + (1— A) nuE (3) for A << 1, (8.43) 
w? = n?w? for A >> 1. 

The n = 1 mode differs slightly from the n > 1 modes because the n = 1 mode has 
the small A dispersion 

w? = wii, — Aw (8.44) 
showing that this mode is the warm plasma generalization of the upper hybrid reso- 
nance. The modes for which n > 1 start at w? = n?w? when A = 0, have a maximum 
frequency at some finite A, and then revert to w? = n?w? as A — oo. To the left 
of the frequency maximum the group velocity 0w/Ok is positive and to the right of 
the maximum the group velocity is negative. This system of modes is sketched in 


Fig.8.1(a). 
2: we >> w? 

For large À, the product of the exponential factor and the modified Bessel function 

in Eq.(8.40) is unity, leaving one more factor of A in the denominator, so again the 

dispersion reduces to cyclotron harmonics, w? = n?w? as A — oo. For small à the 
situation is more involved because to lowest order, the n = 1 term is independent of 
A and so must always be retained when approximating Eq.(8.40). Let us suppose that 
some other term, say the (n + 1)” term is near resonance, i.e., wW? ~ (n + 1)?w?. 
Then, keeping the left hand side, the n = 1 term and the (n + 1)” term, expansion of 
Eq.(8.40) results in 


2 sai n 
EER pate (5) (8.45) 
[n +1) — 1 w2 j? — (n+ 1)? w2] n! \2 
which may be solved for w to give 
w=(n+1)w.41 (8.46) 


(w2 = [n +1}? -— 1| w2) mn) 


Thus, for small A the nt” mode starts at w ~ (n + 1)we and then as \ —> œo the 
frequency decreases towards the asymptotic limit nwe. This dispersion is sketched in 
Fig.8.1(b). 

Excitation of Bernstein waves requires ky = 0 which is a quite stringent requirement. 
The antenna must be absolutely uniform in the direction along the magnetic field because 
otherwise a finite kj will be excited which will result in the waves being subject to strong 
Landau damping. 
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8.4 Warm, magnetized, electrostatic dispersion with small, 
but finite ky 


An alternate limit for Eq.(8.36) would be to allow k to be finite but also have k? r} << 1 
so that only the lowest-order finite Larmor radius terms are retained. To keep matters simple 
and also to relate to cold plasma theory, kj will be assumed to be sufficiently small that 
w/ky >> vr, vri. Since rẹ ,/ = w>,/W2,, the lowest-order perpendicular thermal 
terms will be O(k*+ ) and so perpendicular quantities up to fourth order must be retained. 
This means that both the n = 1 and the n = 2 terms must be retained in the summation 


over n. With these approximations and using 1/ Dae = we /w*, Eq.(8.36) becomes 


1 f kiri Wee ý oovt e720 
© 4 w2 | AZ 


2 
_ 2We5 
252, joe Tae 
ki Woe w We 
2 2 ent 
Di, w 
ki + ki + X (1— kf r?) eo | +iaoyT a = 0 
e = 
7 2 
_ 8We5 
A 2, 2 V2 =) Apge 
KITE Who w A 
Bw? es 
c9: Hay T 
oyT teda 


(8.47) 


The Landau damping terms will be assumed to be negligible to keep matters simple and the 
equation will now be grouped according to powers of k? . Retaining only the lowest-order 
finite-temperature perpendicular terms gives 


w2 w? 3wt A? 
po L pe po po“Do = 
i ( Le arg ) E ere ae) ) = 


o o 


(8.48) 
This is of the form 

—kt en +k? S+ kj P =0 (8.49) 

where the perpendicular fourth-order thermal coefficient is 

ws A? 
po ^Do 

= ; 8.50 
Eth >, (a = ws) (w? — =) en 


Equation (8.49) is a quadratic equation in k? . The cold-plasma model used earlier in effect 
set €+ = Oso that a wave propagating through an inhomogeneous plasma towards a hybrid 
resonance where § — 0 would have k? — oo. This non-physical prediction is resolved 
by the warm plasma theory because etn, while small, is finite and so prevents k? — oo 
from occurring. What happens instead is that Eq. (8.49) has two qualitatively distinct roots, 
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namely 
é St, /S2 + denki P 
ki = ; .51 
i T (8.51) 
2 hot wave 
ky 


going out 


pon 


a a mode 


cold wave coalescence 
X 


S=0 


Figure 8.2: Linear mode conversion of cold mode into hot mode near hybrid resonance 


Far from the hybrid resonance S' is large so that the two modes are well separated and 
given by 


Pog 
a = -zkio cold plasma wave 
> oi (8.52) 
ki =—, hot plasma wave 
Eth 


but as S — 0 these two modes coalesce. The hot plasma wave results from balancing the 
first and middle terms in Eq. (8.49) and the cold plasma wave results from balancing the 
middle and last terms. The actual mode coalescence occurs where the square root term in 
Eq.(8.51) vanishes, i.e., where 


S? = — enki P. (8.53) 


The WKB approximation fails at the mode coalescence, because here dk; /da = oo and 
so a more sophisticated analysis is required. This analysis, which will be discussed in the 
next section, shows that the hot and cold waves are strongly coupled at the coalescence 
region. In particular, it will be demonstrated that a cold wave propagating towards the 
hybrid resonance, as shown in Fig.8.2, will be linearly converted into a hot wave which 
then propagates back out from the resonance. 
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8.5 Analysis of linear mode conversion 


8.5.1 Airy equation 


The procedure for analyzing linear mode conversion was developed by Stix (1965) and is 
an extension of the standard method for connecting solutions of the Airy equation 


y” +2y =0 (8.54) 


from the x < O region to the x > O region. The Airy problem involves a second order 
ordinary differential equation with a coefficient which goes through zero. The mode con- 
version problem involves a fourth order equation which also has a coefficient going through 
zero. The Airy connection method will first be examined to introduce the relevant concepts 
and then these will be applied to the actual mode conversion problem. 

The sequence of steps for developing the solution to the Airy problem are as follows: 

1. Laplace transform the equation: Equation (8.54) cannot be Fourier transformed be- 
cause of the non-constant coefficient x in the second term; however it can be Laplace 
transformed by make use of the relation that the Laplace transform of xy(x) is —dy(p) /dp. 
Thus, the Laplace transform of Eq.(8.54) is 


Pup) === ~ =0: (8.55) 


Strictly speaking, Laplace transform of y” will also bring in terms involving initial 
conditions, but these will be ignored for now and this issue will be addressed later. 
Equation (8.55) is easily solved to give 


9(p) = Aexp(p?/3) (8.56) 


where A is aconstant. The formal solution to Eq.(8.54) is the inverse transform 


ya) = (2ni) f 


G(p)eP*dp = a / ef Mdp (8.57) 
C 


C 


where a = A/2ri, f(p) = p?/3 + px, and C is some contour in the complex plane. 
By appropriate choice of this contour Eq.(8.57) is shown to be a valid solution to the 
original differential equation. In particular, if the endpoints pı and p2 of the contour 
C are chosen to satisfy 

[e o =0 (8.58) 

Pı 

then Eq.(8.57) is a valid solution to Eq.(8.54). The reason for this may be seen by 
explicitly calculating y” and zy using Eq.(8.57) to obtain 


d? 
~ dr? 


i 


y a | eP /3+P@ dn = a f pre? /3+P2dp (8.59) 
C C 
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and 4 
= eP'/3 ( — ert ) d 
xy a fo (= ) a 


d 
= a [er?/+P| is ert a fo I (e°) e?* dp (8.60) 
1 


= -a fo po'/*Pdp, 
Addition of these results corresponds to Eq.(8.54). 


2. Suppose the integrand is analytic in some region and consider two different contours 
in this region having the same pair of endpoints. We can take one of these contours and 
deform it into the other. Thus, the two contours give the same result and so correspond 
to the same solution. 


3. Independence of contours: Since Eq. (8.54) is a second order ordinary differential 
equation it must have two linearly independent solutions. This means there must be 
two linearly independent contours since a choice of contour corresponds to a solution. 
Linear independence of the contours means that one cannot be deformed into the other. 
One possible way for the two contours to be linearly independent is for them to have 
different pairs of end points. In this case, one contour can only be deformed into 
the other if the end points can be moved without changing the integral. If moving 
the end points changes the integral, then the two contours are independent. Another 
possibility is a situation where the two contours have the same pairs of endpoints, but 
the integrand is not analytic in the region between the two contours. For example 
there could be a pole or branch cut between the contours. Then, one contour could 
not be deformed into the other because of the non-analytic region separating the two 
contours. The two contours would then be linearly independent. 


8.5.2 Steepest descent contour 


The solution to the Airy equation is thus 


y(x) a ef P2)dp (8.61) 
g 


where C satisfies the conditions listed above and the solution can be multiplied by an 
arbitrary constant. In order to evaluate the integral it is useful to first separate the complex 
function f (p, x) into its real and imaginary parts, 


Also, it should be remembered that for the purposes of the p integration, x can be consid- 
ered as a fixed parameter. Thus, when performing the p integration, x need not be written 
explicitly and so we may simply write f (p) = fr(p) +if;(p) where p = u + iv and u,v 
are the coordinates in the complex p plane. For an arbitrary contour in the p plane, both f, 
and f; will vary. The variation of phase means that there will be alternating positive and 
negative contributions to the integral in Eq.(8.61), making evaluation of this integral very 
delicate. However, evaluation of the integral can be made almost trivial if the contour is 
deformed to follow a certain optimum path. 
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To see this, it should first be recalled that the theory of complex variables shows that 
any function of a complex variable must satisfy the Cauchy-Riemann conditions, which 
means that f, and f; satisfy 


Of, _ Ofi Əfr ofi 

ðu Ov? Ov OO" ega) 
Also, on defining the gradient operator in the u,v plane V = ti0/Ou + 60/dv it is seen 
that 
Of. a Of. i Of, r Of. i 
ðu Ou Ov Ov 
This means that contours of constant f, are everywhere orthogonal to contours of constant 
fi- 

Since f; corresponds to the phase of exp(f,(p) + if;(p)) and f, corresponds to the 
amplitude, Eq.(8.64) shows that a path in the complex p plane which is arranged to follow 
the gradient of f, will automatically be a contour of constant f;, i.e., a contour of constant 
phase. Thus, the optimum path is to follow V f, because on this path f; will be constant 
and so there will not be any alternating positive and negative contributions as on any other 
path. In fact, because it is constant, the phase can be factored from the integral when using 
this optimum path, giving 


Vir c Vii = 


=0. (8.64) 


y(r) = efi / ef Pap. (8.65) 
CIV f- 


Clearly, the maximum contribution to this integral comes from the vicinity of where 
fr assumes its maximum value. Since the maximum occurs where V fy = 0, most of the 
contribution to the integral comes from the vicinity of where V f, = 0. The extrema of 
f are always saddle points because the Cauchy-Reimann conditions imply 07 fri / ðu? + 
a? Fosa. Ov? = 0. Thus, the vicinity of V f, = 0 must be a saddle point. 

The discussion in the previous paragraph implies that once the endpoints of the contour 
have been chosen, for purposes of evaluation it is advantageous to deform the contour so 
that it follows the gradient of f»; this called the steepest ascent/descent path (usually just 
called steepest descent). This optimum choice of path ensures that (1) there is a localized 
region where f, assumes a maximum value and (ii) the phase does not vary along the path. 
The contribution to the integral is concentrated into a small region of the complex p—plane 
in the vicinity of where f, has its maximum value. Simple integration techniques may be 
used to evaluate the integral in this vicinity, and the contribution from this region dominates 
other contributions because exp( f, (p)) at the maximum of fy is exponentially larger than 
at other places. Furthermore if the contour is chosen so that f, —> —oo at the endpoints 
then Eq.(8.58) will be satisfied and the chosen contour will be a solution of the original 
differential equation. 

The saddle points are located where f'(p) = 0. For the Airy function, 


f(p) = p®/3+pa 
p +2 
fp = 2p (8.66) 


Sh 
Ə 
I 
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so the saddle points (denoted by subscript s) are located at ps = +(—x)!/?. In the vicinity 
of the saddle point, f can be Taylor expanded as 
2 
P—Ds 
fle) = fps) + PPX prp,), (867) 


It is now convenient to define the origin of the coordinate system to be at the saddle point 
and also use cylindrical coordinates, r, 0 so that p — ps = r exp(i@) and for fixed 0, dp = 
dre'®. Also, phasor notation is used for f” so f” = |f"| exp(iv) where w is the phase of 
f". Thus, in the vicinity of the saddle point we write 
i 

1) = Fpa) + EES exp (aio + iv). (8.68) 
Choosing the contour to follow the path of steepest descent corresponds to choosing 0 such 
that 20 + w = +r in which case 


i 
F(p) = F(ps) - lt (Pell (8.69) 
and so Eq.(8.61) becomes 
y(x) ~ I ef (ps)— aT IF" (Ps)| drei? (8.70) 


where the approximation has been made that nearly all the contribution to the integral 
comes from small r; it is not necessary to worry about errors in the Taylor expansion 
at large r, since the contributions from large r are negligible because of the exponential 
behavior. Thus f, is maximum at the saddle point r = O and the main contribution to 
the integral comes from the contour going over the ridge of the saddle. The r integral is a 
Gaussian integral f exp(—ar?)dr = \/7/a so 


y(t) = i. ef 0)?" Pdre? 


Co 


= aa! eof" (Padl/2gp 


— ept |__2% 
|f" (ps)| 
i20 
= efte.) ,| 27°? 
|f" (ps)| 
= ef (Ps) men 
|f" (ps)| 
: 2T 
= ef(s) Fp (8.71) 


Since f(ps) = p (p?/3 + £) = +V—a (22/3) = F2(—x)?/2/3 and f’ (ps) = +2/—z it 


is seen that 
ef) dp A EN aye. (8.72) 
vicinity (—a)1/2 
of saddle 
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Providing x is not too close to zero, the two saddle points determined by the + signs are 
well-separated and do not perturb each other; the integral will then be a summation over 
whichever saddle points the contour happens to pass over. The polarity of a saddle point is 
determined by the sense in which the contour passes through the saddle point. 


8.5.3 Relationship between saddle point solutions and WKB modes 


For large |x| Eq. (8.54) can be solved approximately using the WKB method, in which case 
it is assumed that y(x) = A(z) exp(i f” k(x')dz’). The wavevector k(x) is determined 
from the dispersion relation associated with the original differential equation and A(z) is a 
function of k(x). At large |x| the dispersion relation associated with Eq.(8.54) obtained by 
assuming d/dx — ik is 

P= «2 (8.73) 
so k(x) = +x"? and A(x) ~ (+a)~1/*. The WKB solution is thus 


l +i f? 2/2 da! 1 +i2z?/?/3 
YWKB ~ ETN J = aas (8.74) 


which is the same as the saddle point solution. Thus each saddle point solution corresponds 
to a WKB mode, and in particular to a propagating wave if x > 0 and to an evanescent mode 
ifx <0. 


8.5.4 Using boundary conditions to choose contours 


The choice of contour is determined by the boundary conditions which in turn are deter- 
mined by the physical considerations. In this Airy equation problem, the boundary condi- 
tion would typically be the physical constraint that no exponentially growing solutions are 
allowed for x < 0. This forces choice of a contour which does not pass through the saddle 
point having the plus sign in Eq.(8.74) when x < 0. This condition together with Eq.(8.58) 
uniquely determines the contour. Choosing a contour C is equivalent to specifying a par- 
ticular solution to Eq.(8.57); this chosen solution can be evaluated for x > 0. For x > 0 
the saddle points become reshuffled and different and so for purposes of evaluation along 
the steepest descent path, the chosen contour must be deformed to pass through the saddle 
points as they exist for x > 0. The sum of the contributions of these saddle points gives the 
form of the solution for x > 0. The important point here is that the same contour is used for 
both x > 0 and x < 0 because the linearly independent solution is determined by the con- 
tour. The contour is determined by its endpoints, but may be deformed provided analyticity 
is preserved. Thus, the endpoints are the same for both the x > 0 and x < 0 evaluations, 
but the deformations of the contour differ in order to pass through the respective x > 0 or 
x < 0 saddle points. The sign of x determines the character of the saddle points and so for 
purposes of evaluation the contour is deformed differently for x > 0 and x < 0. 

This completes the discussion of the Airy problem and we now return to the linear 
mode conversion problem (Stix 1965). A comparison of Eqs.(6.108) and (8.49) shows that 
if Eq.(8.49) is “un-Fourier” analyzed in the x-direction, it becomes 


d d? d J 
aE (cna s$) p— k? Po=0 (8.75) 
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where it is assumed that €+, is approximately constant. To prove this is the correct form, 
consider the following two statements: (i) if €4, — 0, Eq.(8.75) reverts to Eq.(6.108) 
(Fourier-analyzed in the z direction, but not the x direction), (ii) if €,, is finite and S and 
P are assumed uniform, then Fourier analysis in x restores Eq.(8.49). 

We now suppose that the plasma is non-uniform such that S(x) = 0 at some particular 
value of x.The x origin is defined to be at this location and so corresponds to the location 
of the hybrid resonance. In order to be specific, we assume that we are dealing with a mode 
where S > Oand P < 0, since S and P must have opposite signs for the cold plasma wave 
to propagate. Then, in the vicinity of x = O Taylor expansion of S gives S = S'x so that 
Eq.(8.75) becomes 


ae d q? 
Eth 2 HS” £ t S'x s ki Po =0 (8.76) 
which has a coefficient which vanishes at x = O just like the Airy equation. Although 
Eq.(8.76) could be analyzed as it stands, it is better to clean it up somewhat by changing to 


a suitably chosen dimensionless coordinate. This is done by first defining 


x 
(= (8.77) 


where A is a yet-undetermined characteristic length which will be chosen to provide maxi- 
mum simplification of the coefficients. Replacing x by € in Eq.(8.76) gives 


en dtd  Sidb , S! a 


Matt Nae TO oe ki Po =0. (8.78) 
This becomes do dd Lo 
a ae £ ae ud =0 (8.79) 
if A is chosen to have €;),/S'A? = 1 and we set u = =k AP/S'. These choices imply 
2p_1/3 
`= e , p= es . (8.80) 


Equation (8.79) describes a boundary layer problem having complicated behavior in the 

inner || < 1 region in the vicinity of the hybrid resonance and, presumably, simple WKB- 

like behavior in the outer |£| >> 1 region away from the hybrid resonance. To solve this 

problem the technique discussed for the Airy equation will now be applied and generalized: 

1. Laplace Transform: While Eq.(8.79) cannot be Fourier analyzed in the € direction, it 
can be Laplace transformed giving 


Re 2 d A . 
palo) + pale) — =, [PIR] + Hdl) = 0 (881) 
or equivalently 
d [25 2,1 Hv 
ae = Se 8.82 
P(p)| dp p ()| P ttp wee) 


This has the solution 


(p) = Aexp(p*/3 — n/p — np) (8.83) 


8.5 Analysis of linear mode conversion 247 


where A is a constant. The inverse transform is 


pE) =A f ef Pdp (8.84) 
C 
where 
3 
f(y) =F — Fn) +96. (8.85) 


. Boundary conditions: Since Eq.(8.79) is a fourth-order ordinary differential equation, 
four independent solutions must exist with four associated independent choices for the 
contour C in Eq.(8.84). The appropriate contour is determined by the imposed physi- 
cal boundary conditions which must be equal in number to the order of the equation. 
We consider a specific physical problem where an external antenna located at £ >> 0 
generates a cold plasma wave propagating to the left. This physical prescription im- 
poses the following four boundary conditions: 


(a) To the right of € = O there is a cold plasma wave with energy propagating 
to the left (i.e., towards the hybrid layer). It is important to take into account 
the fact that the group velocity is orthogonal to the phase velocity for these 
waves in which case the wave is backwards in either the x or the z direction. 
The dispersion of these waves is given in Eq.(6.102) and the group velocity in 
Eq.(6.104). A plot of Sw) and P(w) shows that 3S /ôw > 0 and OP/dw > 0. 
Thus, for the situation where S > 0 and P < 0 the x component of the wave 
phase and group velocities have opposite sign. Hence, the cold wave phase 
velocity must propagate to the right to be consistent with the boundary condition 
that cold wave energy is propagating to the left. 


(b) To the right of € = O there is no warm plasma wave propagating to the left (for 
the hot wave, energy and phase propagate in the same direction). 


(c) To the left of £ = 0 the cold plasma evanescent mode vanishes as £ — —oo. 


(d) To the left of € = 0 the hot plasma evanescent mode vanishes as £ — —oo. 


The question is: what happens at the hybrid layer? Possibilities include absorption of 
the incoming cold plasma wave (unlikely since there is no dissipation in this problem), 
reflection of the incoming cold plasma wave at the hybrid layer, or mode conversion. 


. Calculation of saddle points: The saddle points are the roots of 


fp) =r ret D= Si. (8.86) 
p P 

For large £, these roots separate into two large roots (hot mode) given when the first 

two terms in Eq.(8.86) are balanced with each other and two small roots (cold mode) 

given by balancing the second and third terms. The large roots satisfy p? = —€ while 

the small roots satisfy p? = —ju/€. For large £, the fourth term is small compared to 

the dominant terms for both large and small roots. The quantity f can be approximated 


248 Chapter 8. Vlasov theory of warm electrostatic waves in a magnetized plasma 


for large p as 


2 
= ($ e) Inp 
2 
= gps — inp (8.87) 


while for small p it can be approximated as 


fe) > momi 
-o(a 
-olw 


= 2p£— np. (8.88) 


For € > 0 and large roots, the quantities p,, f (pı) and f” (pı) are respectively 


pi =+)",  f(m) = + (be —Inp,, f"(pi) = +2(-8)'”?. (8.89) 


For € > 0 the corresponding quantities for the small roots are 


3/2 
ps =+(—n/£)"?, F(ps) = +2(—-Eu)'/? — Inps, ft.) = 72S . (8.90) 


The Gaussian integrals corresponding to steepest descent paths over these € > O sad- 
dle points are 


large roots: | ef dp = ame (8.91) 


of saddle 


small roots : ef dp = SF +2 guy? (8.92) 
mere (=n)? 


where the logarithmic term in f (p) has been taken into account. For € < 0, the large 
root quantities are 


2 
pi =E, f(a) = a —Inp, f"(p) = +261? (8.93) 
and the small root quantities are 


3/2 
pe= (MEDY, Fo) = F2( EW"? mps fo) =F É 8% 
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4. Choice of contour path: For € < O and assuming u > O the saddle points giving 
solutions that vanish when £ — —oo are the saddle points with the upper (i.e., minus) 
sign chosen in the argument of the exponential. Hence, for € < 0 a contour must be 
chosen that passes through one or both of these saddle points. For € > O the upper sign 
corresponds to waves propagating to the left (1.e., towards the hybrid layer). Hence, 
we allow the upper sign (minus) for the small root (i.e., cold mode), but not for the 
large root since one of the boundary conditions was that there is no inward propagating 
hot plasma wave. To proceed further it is necessary to look at the topography of the 
real part of f (p) for both signs of £; see assignments. 


From these plots it is seen that the correct joining is given by 


.exp(—3[E|°/?) exp[3 (i€)?/?] 
\e|3/4 j1/2¢3/4 
<> 
exp[=2u]§|/7] 212 pile)" ?] (8.95) 
(len) 74 (EN 
evanescent side, € < 0 propagating side, > 0 


which shows that a cold wave with energy propagating into the S = O layer is con- 
verted into a hot wave that propagates back out. 


5. WKB connection: The quantities in Eq.(8.95) can be expressed as integrals having the 
same form as WKB solutions. For example, the cold propagating term can be written 
as 


expl2i(ué)'/?] _ exp fo (u/s!) as" 
wae VIUEN aE 


peal fo (—kK2P/S)" azr] 


Jo K2P/S)"/2a2'] 
where the last form is clearly the WKB solution. A similar identification exists for 
the hot plasma mode, so that Eq. (8.95) can also be written in terms of the WKB 
solutions. 

This is just one example of mode conversion; other forms also occur in different con- 
texts but a similar analysis may be used and similar joining conditions are obtained. A 
curious feature of the mode conversion analysis is that the differential equation is never 
explicitly solved near € = 0; all that is done is match asymptotic solutions for the region 
where € is large and positive to the solutions for the region where € is large and negative. 


(8.96) 


8.6 Drift waves 


Only textbook plasmas are uniform — real plasmas have both finite extent and pressure 
gradients. As an example, consider an azimuthally symmetric cylindrical plasma immersed 
in a strong axial magnetic field as sketched in Fig.8.3. Particles stream freely in the axial 
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direction but are constrained to make Larmor orbits in the perpendicular direction. Since 
the concept of pressure gradient has no meaning for an individual particle, consideration of 
the effect of pressure gradients requires a fluid or Vlasov point of view. This will be done 
first using a two-fluid analysis, then using a Vlasov analysis. 

From the two-fluid point of view, the radial pressure gradient implies an equilibrium 
force balance 


0=9a X Beng V (noKT3). (8.97) 


Solving Eq.(8.97) for uz shows that each species has a steady-state perpendicular motion 
at the diamagnetic drift velocity 


Udo = (8.98) 


which is in the azimuthal direction. The corresponding diamagnetic drift current is 


Ja = S/N doUae 
= 35 VV (nese) x B 


1 
= -VP xB (8.99) 


which is just the azimuthal current associated with the MHD equilibrium equation J x B = 
VP where P = P; + P}. 

The electron and ion diamagnetic drift velocities thus provide the current needed to 
establish the magnetic force which balances the MHD pressure gradient. It turns out that 
magnetized plasmas with pressure gradients are unstable to a class of electrostatic modes 
called ‘drift waves. These modes exist in the same frequency regime as MHD but do not 
appear in standard MHD models, since standard MHD models do not provide enough detail 
on the difference between electron and ion dynamics. 

From the particle point of view, the diamagnetic velocity is entirely fictitious because 
no particle actually moves at such a velocity. The diamagnetic drift velocity is nevertheless 
quite genuine from the fluid point of view and so a fluid-based wave analysis must linearize 
about an equilibrium that includes this equilibrium drift. As will be seen later, the Vlasov 
point of view confirms and extends the conclusions of the fluid theory, providing care is 
taken to start with an equilibrium velocity distribution function which is both valid and 
consistent with existence of a pressure gradient. 
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Figure 8.3: Cylindrical magnetized plasma with radial density gradient 


Drift waves will be examined using three progressively more realistic points of view. 
These are: 
1. A collisionless two-fluid model where ions are assumed to be cold and electrons are 
assumed to be hot. This model establishes existence of the mode and provides a 
derivation for the intrinsic frequency, but provides no information regarding stability. 


2. A collisional two-fluid model which shows that collisions destabilize drift waves. 


3. A Vlasov model including both finite ion temperature and net axial current. This 
model shows that both Landau damping and axial currents destabilize drift waves. 

Drift waves involve physically distinctive ion and electron motions, three-dimensional 
geometry, magnetized warm plasma effects, pressure gradients, collisionality, and Landau 
damping/instability. Drift waves are often of practical importance because they are so eas- 
ily driven unstable. While real plasmas have geometry resembling Fig.8.3, the cylindrical 
geometry will now be replaced by Cartesian geometry to make the analysis more straight- 
forward. 

Using Cartesian geometry, the equilibrium magnetic field is assumed to be in the z 
direction and the new feature, an exponential density gradient in the x direction, represents 
the pressure gradient. The x direction thus corresponds to the r direction of cylindrical 
geometry, and the y direction corresponds to the 0 direction. The y and z directions, but not 
the x direction, are ignorable coordinates, and so the governing equations may be Fourier- 
transformed in the y and z directions, but not in the x direction. 
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8.6.1 Simple two-fluid model of drift waves 
The plasma is assumed to have the exponential density gradient 


n ~ exp(—2/L) (8.100) 


where L is called the density-gradient scale length. We consider electrostatic, extremely 
low frequency (w << wei) waves having the potential perturbation 


ob, = bexp(ikyy + ikzz — iwt). (8.101) 
The parallel phase velocity is assumed to lie in the range vr; << w/k, << vre so that 
ions are adiabatic and electrons are isothermal (this is the same regime as ion acoustic 
waves). The parallel and perpendicular wavelengths are both assumed to be much longer 
than the electron Debye length, so that the plasma may be considered quasi-neutral and so 
have n; ~ ne. Since w/k, << Vre, the parallel component of the electron equation of 
motion is simply 

0 S —q.-— — —— (neKTe) (8.102) 


which leads to a Boltzmann electron density, 


Ne = Neo EXP(—deb, /KTe). (8.103) 


Assuming ¢, is small, linearization of this Boltzmann electron density gives the first-order 
electron density 


Mer n qe? 


8.104 
Ned KT ( ) 


Since w/k, >> vri, ions may be considered cold to first approximation and the zero- 
pressure limit of the ion equation of motion characterizes ion dynamics. Furthermore, 
since w << wei the lowest-order perpendicular ion motion is just the E x B drift so 


—-V¢o,xB __ ikyd, . 
= T 
B? B 
Since the & x B driftis in the x direction, it is in the direction of the density gradient and 
so this drift leads to an ion density perturbation because of a convective interaction with 
the equilibrium density gradient. The ion density perturbation is found by linearizing the 
ion continuity equation 


(8.105) 


uü; = 


On; 
oa + ua- Vno +naV -ua = 0. (8.106) 
After noting that V - u;; = 0, substitution for u; in the convective term gives 
NA kydy 
= ‘ 8.107 
nio wLB ( ) 


The density perturbation results from the ion E x B drift causing the ions with their density 
gradient to slosh back and forth in the x direction so that a stationary observer at a fixed 
point x sees oscillations of the ion density. 

Quasi-neutrality means that the electron and ion density perturbations densities must be 
almost exactly equal although electrons and ions are governed by entirely different physics. 
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Equating the electron and ion perturbed densities, respectively given by Eqs.(8.104) and 
(8.107), provides the most basic form of the drift wave dispersion relation, 


kyKTe 
w= — LB = kyUde, (8.108) 
where 
Ude = —KTe /qe LB (8.109) 


is the electron diamagnetic drift velocity given by Eq.(8.97). Equation (8.108) describes a 
normal mode where Boltzmann electron density perturbations caused by isothermal elec- 
tron motion along field lines neutralize ion density perturbations caused by ion E x B 
motion sloshing the x-dependent equilibrium density profile in the x direction. This ba- 
sic dispersion relation provides no information about the mode stability because collisions 
and Landau damping have not yet been considered. The wave phase velocity w/k, is equal 
to the electron diamagnetic drift velocity given by Eq.(8.98). This dispersion relation has 
the interesting feature that it does not depend on the mass of either species; this is because 
neither the electron Boltzmann dependence nor the ion EF x B drift depend on mass. This 
basic dispersion is the building block for the more complicated models to be considered 
later, and it is conventional to define the ‘drift frequency’ 


w* = kytige (8.110) 


which will appear repeatedly in the more complicated models. The basic dispersion is 
therefore simply w = w* (the asterisk should not be confused with complex conjugate). 


8.6.2 Two-fluid drift wave model with collisions 


The next level of sophistication involves assuming that the plasma is mildly collisional so 
that the electron equation of motion is now 
du, 1 

Me aS qe (—V¢é+ ue x B) — NV (neKTe) — VeiMe(Ue — Uj). (8.111) 
The collision frequency is assumed to be small compared to the electron cyclotron fre- 
quency. Because w/k, << vre, the electron inertia term on the left hand side is negligible 
compared to the electron pressure gradient term on the right hand side and so the parallel 
component of the electron equation reduces to 


Od KT. One 
az Ne OZ 


the ion parallel velocity has been dropped since it is negligible compared to the electron 
parallel velocity. After linearization and assuming perturbed quantities have a space-time 
dependence given by Eq.(8.101), Eq.(8.112) can be solved to give the parallel electron 
velocity 


0= 


VeiMeuez; (8.112) 


ik, KT, n 
TERE f Me oe Met (8.113) 
. . . . . VeiMe . KTe Neg 
It is seen that in the limit of no collisions, this reverts to the Boltzmann result, Eq.(8.104). 
As in the previous section, quasi-neutrality is invoked so that the dispersion relation is 


obtained by equating the perturbed electron and ion densities. Equation (8.113) together 
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with the continuity equation gives the perturbed electron density, provided the electron 
perpendicular velocity is also known. Since it has been assumed that wce >> Vei, the 
magnetic force term in Eq.(8.111) is much greater than the perpendicular component of 
the collision term. The perpendicular component of the electron equation of motion is 
therefore 


1 
0= qe (V1 + ue xB)- V1 (neKTe). (8.114) 


In previous derivations the continuity equation was typically linearized right away, but 
here it turns out to be computationally advantageous to postpone linearization and instead 
solve Eq.(8.114) as it stands for the perpendicular electron flux to obtain 


neVOxB 1 
B2 qeB? 


Using the vector identities V - (Vo x B) = 0 and V - [V (n-kT-) x B] = 0 it is seen that 
the divergence of the perpendicular electron flux is 


Te, = Nee, = V (nekT.) x B. (8.115) 


Vo xB 
Mie Ville age (8.116) 
The electron continuity equation can be expressed as 
One ð 
— (Neuez) +V -Tel =0, .117 
OE + az (Netlez) + ale (8.117) 
and, after substitution for the perpendicular electron flux, this becomes 
One o VoxB 
aL + 5, (Metlez) — Vne -pR = 0. (8.118) 


At this stage of the derivation it is now appropriate to linearize the continuity equation 
which becomes 
Ne ðuezı dne., Vo, xB _ 


+ Ned ĉ- 0. (8.119) 
oy. l dz dr B? l 
Substituting for @, using Eq.(8.101), and using Eq.(8.113) for uzı gives 


; K?KT.\ ged, [K?KT, ik T, 
ra (iwa 2K j+ en (= p y )=0 (8.120) 


VeiMe Te VeiMe qeBL 


which may be solved to give the sought-after electron density perturbation 


= ry (8.121) 


Here 

T= VeiMe/k?Te (8.122) 
is the nominal time required for electrons to diffuse a distance of the order of a parallel 
wavelength. This is because the parallel collisional diffusion coefficient scales as Dj ~ 
(random step)?/ (collision period) ~T./v-;me and according to the diffusion equation 
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the distance diffused in time t is given by z? ~ 4D\\t. Equation (8.121) shows that in the 
limit T} — 0, the electron density perturbation reverts to being Boltzmann, but for finite 
T|, a phase lag occurs between ne and ¢. 

The next step is to calculate the ion density perturbation. The ion equation of motion 
is 
du; 
dt 
because the ions are heavy, the left hand side inertial term must now be retained. The ions 
are assumed to be cold and the ion inertial term is assumed to be much larger than the 
collisional term so that the parallel component of the linearized Eq.(8.123) is 


= qi (-V¢+u; x B) — viem;(u; — ue); (8.123) 


Mi 


keGiby _ kee? qidi 
a= = s : 8.124 
eel WM; w KTe ( ) 


Again assuming that the inertial term is much larger than the collisional term, the linearized 
perpendicular ion equation becomes 


du; 
* dt 


= qi (—V¢ġı +u; x B) (8.125) 


which can be solved to give the perpendicular velocity as a sum of an E x B drift and a 
polarization drift, i.e., 


-VọxB m; d 
B: taB Os (8.126) 


u; = 


The polarization drift is retained in the ion but not the electron equation, because polariza- 
tion drift is proportional to mass. The perpendicular ion flux is 


—niVo xB MM; d 


Tr; = BR BaB ETA Vg) (8.127) 


which has a divergence 


V-D = Vn; - 


Vo xB Ge d 


77 BP di (v.0)) : (8.128) 


Substitution of Eq.(8.128) and (8.124) into the ion continuity equation, linearizing, and 
solving for the ion density perturbation gives 


22 
Wah ee E 
= T 


de kykTe MKT. 2 Gidy 
Nis w2 qi wqe LB q? B2 ” 


Te 
(8.129) 


= ke? qe w* pene qidy 
w? qi w AAE y 
where p? = KTe/m;w?; is a fictitious ion Larmor orbit defined using the electron instead 


of ion temperature. The fictitious length p2 is analogous to the fictitious velocity c2 = 
KTe/m; that occurred in the analysis of ion acoustic waves. 
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Since the plasma is quasi-neutral, the normalized electron and ion density perturbations 
must be the same. Equating the normalized density perturbations obtained from Eq.(8.121) 
and (8.129) gives 


ko. (8.130) 


For simplicity it is assumed here that q;/qe = —1, which is the situation for low tem- 
perature plasmas where the ions are too cold to be multiply ionized (q;/q. = —1 is, of 
course, always true for hydrogen plasmas). It is also assumed that the collision frequency 
is sufficiently small to have WT = WeiMe / k?Te << 1 and that w is of the order of 
w*; the self-consistency of these assumptions will be checked later. The left hand side of 
Eq.(8.130) can now be expanded using the binomial theorem to obtain the collisional drift 
wave dispersion relation 


* 2s 0) 
D(w, ky, ke) =1 Z Hkg — EE + iw — w')ry = 0. (8.131) 


This dispersion relation shows that drift waves have an association with ion acoustic waves 
since in the limit k?p? — 0, the real part of the dispersion becomes 


y 
* k2 2 
E -=8 =0 (8.132) 
W W 


which becomes an ion-acoustic wave in the limit of no equilibrium pressure gradient. Equa- 
tion (8.132) has two roots 
w* +4/(w*)? + 4k22 


- 1 
w 5 (8.133) 


which for small kc, are 
1. w = w* and w = —|k,c,| ifw* > 0 


2. w = w* and w = |kzcs| if w* < 0. 
Thus, the drift mode is a distinct mode compared to the ion acoustic wave and its parallel 
phase velocity is much faster than the ion acoustic wave since the drift wave occurs in the 
limit w/k; >> Cs. 

We now address the important question of the stability properties of Eq.(8.131). To 
do this, k, is assumed to be sufficiently small to have k,c, << w so that the dispersion 
describes the drift mode and not the ion acoustic mode. The real part of the dispersion is 
then set to zero to obtain the real part of the frequency 


* 


Ww 


ESTIRA 


(8.134) 
showing that the actual frequency of the collisional drift wave is smaller than w*, but as 
assumed is of the same order, provided k? p2 is not larger than order unity. The k? pr 
dependence results from ion polarization drift, an effect which was neglected in the initial 


simple model. The assumption wT) = Wei Me /k2T. << 1 implies kyVei/ k? Lwece << 1 
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which is true provided the ratio Vei/Wce is sufficiently small for a given geometric factor 
ky/ k2L. The imaginary part of the frequency is 


F mak 2,2, ,%* 
Di (w= w )T] a KPT] 
Wi = — =i) ———___ = ——— > 


/ ry* [32 3 
(OD, /Ow),—.., Ww [o3 (1 + k2p2) 


(8.135) 


and has several important features: 
1. w; is positive so collisional drift waves are always unstable. 


2. w; is proportional to 7) so that modes with the smallest k, and hence the fastest par- 
allel phase velocities are the most unstable, subject to the proviso that wr << | is 
maintained. This increase of growth rate with k7 ! or equivalently parallel wavelength 
means that drift waves typically have the longest possible parallel wavelength consis- 
tent with boundary conditions. For example, a linear plasma of finite axial extent with 
grounded conducting end walls has the boundary condition that ¢, = O at both end 
walls; the longest allowed parallel wavenumber is m/h where h is the axial length of 
the plasma (i.e., half a wavelength is the minimum number of waves that can be fitted 
subject to the boundary condition). 


3. Collisions make the wave unstable since T| ~ Vei. 


4. w; is proportional to the factor k3 p2/(A+ k3 p2)? which has its maximum value when 
kyp, is of order unity. 


5. w;is proportional to L~?; for a realistic cylindrical plasma (cf. Fig.8.3) the density 
is uniform near the axis and has a gradient near the edge (e.g., a Gaussian density 
profile where n(r) ~ exp(—r?/L7)). The density gradient is localized near the edge 
of the plasma and so drift waves should have the largest growth rate near the edge. 
Drift waves in real plasmas are typically observed to have maximum amplitude in the 
region of maximum density gradient. 

The free energy driving drift waves is the pressure gradient and so the drift waves might 
be expected tend to deplete their energy source by flattening out the pressure gradient. 
This indeed happens and in particular, the nonlinear behavior of drift waves reduces the 
pressure gradient. Drift waves pump plasma from regions of high pressure to regions of 
low pressure. This pumping can be calculated by considering the time-averaged, nonlinear, 
x-directed particle flux associated with drift waves, namely 


1 
Ts = (RenyReui,) = zke (n1Uj,)- (8.136) 


The species subscript ø has been omitted here, because to lowest order both species have 
the same uiz given by the Æ x B drift, i.e., 
. —Vġ xB . 
uir = ĉ- a = —ikyo,/B. (8.137) 
Being careful to remember that w* is a real quantity (this is a conventional, but confus- 
ing notation), Eqs.(8.121) and (8.137) are substituted into Eq.(8.136) to obtain the wave- 
induced particle flux 
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1 Nofe; y lhl? 
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(8.138) 
k?p? kilel? VeinoMe 
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Thus, it is seen that collisional drift waves cause an outward diffusion of plasma with a 
nonlinear wave-induced diffusion coefficient 


k2 2 k2 2 l 
Daj = yPs yll VeiMe (8.139) 
1+k2p2] 2B? kT, 


This diffusion coefficient has a proportionality similar to w;.The drift wave thus has the 
following properties: (i) the pressure gradient provides free energy and also an equilib- 
rium where drift waves are a normal mode, (ii) collisions allow the drift wave to feed on 
the available free energy and grow, (iii) nonlinear diffusion flattens the pressure gradient 
thereby depleting the free energy and also undermining the confining effect of the mag- 
netic field. A nonlinear saturated amplitude can result if some external source continuously 
replenishes the pressure gradient. 

It was shown earlier that the most unstable drift waves are those having k,p, of order 
unity. This property can be adapted to the more physically realistic situation of a cylindrical 
plasma by realizing that in a cylindrical plasma ky is replaced by m/r where m is the 
azimuthal mode number. Periodicity in 6 forces m to be an integer. If the most unstable 
m is a small integer, then the drift waves tend to be coherent, but if m is a large integer, 
many azimuthal wavelengths fit into the circumference of the cylinder. In this case, the 
periodicity condition is only a weak constraint and the waves typically become turbulent. 
Large m corresponds to small p, which occurs when the magnetic field is strong. Thus, 
plasmas with strong magnetic fields tend to have turbulent, short perpendicular wavelength 
drift waves, whereas plasmas with weak magnetic fields have coherent, long perpendicular 
wavelength drift waves. 

Drift wave turbulence has been found to be the main reason why the radial diffusion in 
tokamak magnetic plasma confinement devices is much worse than can be explained by 
simple random walks due to particle collisions. 


8.6.3 Vlasov theory of drift waves: collisionless drift waves 


Drift waves also exist when the plasma is so hot that the collision frequency becomes 
insignificant and the plasma can be considered collisionless. The previous section showed 
that a collision-induced phase lag between the density and potential fluctuations produces 
a destabilizing imaginary term in the dispersion relation. The analysis of collisionless 
plasma waves showed that Landau damping also causes a phase shift between the density 
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and potential fluctuations resulting in an imaginary term in the dispersion relation. Also, 
since Vlasov analysis showed that a relative motion between electrons and ions (i.e., a 
net current) destabilizes ion acoustic waves, it is reasonable to expect that current-driven 
destabilization could occur for drift waves. 

The Vlasov analysis of drift waves is less intuitive than the fluid analysis, but the reward 
for abstraction is a more profound model. As in the fluid analysis, the plasma is assumed to 
have a uniform equilibrium magnetic field B = B2, an x-directed density gradient given 
by Eq.(8.100), and a perturbed potential given by Eq.(8.101). In a sense the Vlasov analysis 
is simpler than the fluid analysis, because the Vlasov analysis involves a modest extension 
of the warm plasma electrostatic Vlasov model discussed in Section 8.1. As before, the 
perturbed distribution function is evaluated by integrating along the unperturbed orbits, 
i.e., 


t 
foi(X,V,t) = J dt! Fag (8.140) 
m 


Y o ðv E. 
The new feature here is that the equilibrium distribution function must incorporate the 
assumed density gradient. 

A first instinct would be to accomplish this by simply multiplying the Maxwellian dis- 
tribution function of Section 8.1 by the factor exp(—x/L) so that the assumed equilibrium 
distribution function would be f,o(x,v) = (muro) ?/? exp(—v?/v3,, — x/L). This ap- 
proach turns out to be wrong because f,o(x,v) = (muro)? exp(—v?/v2., — x/L) is 
not a function of the constants of the motion and so is not a solution of the equilibrium 
Vlasov equation. 

What is needed is some constant of the motion which includes the parameter x. The 
equilibrium distribution function could then be constructed from this constant of the motion 
and arranged to have the desired x—dependence. The appropriate constant of the motion is 
the canonical momentum in the y direction, namely 


Py = Movy + qo Åy = Mody + qo B-T = Ge +: z) qB- (8.141) 
co 
since B, = A, /ðx. Multiplying the original Maxwellian by the factor exp|— (vy /wco + 
x)/L] produces the desired spatial dependence while simultaneously satisfying the require- 
ment that the distribution function depends only on constants of the motion. Note that Py 
is a constant of the motion because y is an ignorable coordinate. 

It turns out that z-directed currents can also destabilize collisionless drift waves, much 
like currents provide free energy which can destabilize ion-acoustic waves. Since it takes 
little additional effort to include this possibility, a z-directed current will also be assumed 
so that electrons and ions are assumed to have distinct mean velocities u,,. In a frame 
moving with velocity w,, the species ø is thus assumed to have a distribution function 
~ exp[—mv??/2 — (vy/wee + )/L] where v’ = v—u,zZ is the velocity in the moving 
frame. This is a valid solution to the equilibrium Vlasov equation, since both the energy 
mv’? /2 and the y-direction canonical momentum (vy/wes + x)/L are constants of the 
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motion. The appropriately normalized lab frame distribution function is 


2 
Noo 5)2 VT 
foo(z,v) = Be exp | (v — tg22)" /v2., — (vy/Wee + x)/L (=) 


(8.142) 
where the normalization factor exp (- (ur/ 2weL)°) has been inserted so that the zeroth 


moment of fo gives the density. The necessity of this factor is made evident by completing 
the squares for the velocities and writing Eq.(8.142) in the equivalent form 


v y’ 
2 | | T | 2 
fev) = = (» i zz) esa 8.143 
OY) alas exp T ZI: (8.143) 


The term v,/wW-o which crept into Eq.(8.142) because of the seemingly abstract require- 
ment of having the distribution function depend on the constant of the motion Py cor- 
responds to the fluid theory equilibrium diamagnetic drift. This correspondence is easily 
seen by calculating the mean y-direction velocity (i.e. first moment of Eq.(8.142)) and find- 
ing that the v,/w-, term results in a fluid velocity u, which is precisely the diamagnetic 
velocity given by Eq.(8.98). 

We now insert Eq.(8.142) in Eq.(8.140) to calculate the perturbed distribution function. 
The unperturbed particle orbits are the same as in Section 8.1 because the concepts of both 
density gradient and axial current result from averaging over a distribution of many parti- 
cles and so have no meaning for an individual particle. The integration over unperturbed 
orbits is thus identical to that of Section 8.1, except that a different equilibrium distribution 
function is used. 

The essence of the calculation is in the term 


Ofoon  . ð foo 
Voz: Ov ps id, k- ðv 
(8.144) 
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which causes Eq.(8.140) to become 


a t 
f 1(x vV t) => _ Gobfoo dt' ek xt) iot" 2ik - M 2ikzUoz i iky 
o 45 Mo J cee V2, T WoL 
y 2e it! dik x(t’) 
E dob fco dt! vo dat! 
Meg 2ik,Ugz a iky eik-x(t!)—iwt! 
ae Ube Worl 
eA t 
2dobfo ike: me ee ee) 
= _ 2defoo ek x(t) lwt +i (w a keUgs — w3) J dt'e'* x(t ) iwt 
MoUTo 2 
(8.145) 
here y 
k ky «KL 
e ws (8.146) 


2wesL  qBL 7 
has been used as a generalization of Eq.(8.110). The rest of the analysis is as before and 
gives the dispersion relation 


1 (w kzUoz wž) = —A 
14 14 Z In(Ac °Z(Anog)| =O 147 
» kybo k-UTo DD ( Je (a ) (8 ) 


o n=— oo 


where as before A, = Rare and Ang = (w — Nnweco )/kzUTo. It is seen that the density 
gradient and the axial current both provide a Doppler shift in the a9, term. In both cases 
the Doppler shift is given by the wavenumber in the appropriate direction times the mean 
fluid velocity in that direction. 

Because Eq.(8.147) contains so much detailed physics in addition to the sought-after 
collisionless drift wave*, some effort and guidance is required to disentangle the drift wave 
information from all the other information. This is done by taking appropriate asymptotic 
limits of Eq.(8.147) and guidance is obtained using the results from the two-fluid analysis. 
These results showed that drift waves exist in the regime where: 


1. vri << w/k, << vre so that the ions are adiabatic and the electrons are isothermal, 
2. WWE << Wei, Wee, 


3. k? r? ~ 0 because the extremely small electron mass means that the electron Larmor 
orbit radius is negligible compared to the perpendicular wavelength even though the 
electrons are warm, 


4. k?r?; <1 since the ions were assumed cold, 


5. k?p ~1, 


6. Bens << 1 so that the waves are quasi-neutral, 


4Fq.(8.147) also describes Bernstein waves, the electrostatic limit of magnetized cold plasma waves, and mode 
conversion. 
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7. |uzil << |uze| since the parallel equilibrium velocities satisfy meue, + Muiz = 0; 
there is no parallel flow, just a parallel current. 
Because Ae ~ O, all electron terms in the summation over n vanish except for the 
n = Oterm. Since w/k, << vre, the small-argument limit of the plasma dispersion 
function is used for electrons and it is seen that the imaginary term is the dominant term 
for the electrons. In contrast since vr; << w/k,, the large-argument limit of the plasma 
dispersion function is used for the ions and also equilibrium parallel motion is neglected for 
ions because of the large ion mass. For the n 4 0 harmonics the terms Z7(a,)+Z(a_,) > 
—kvrill/(w — nw) + 1/(w + nweci)] and hence cancel each other since w << wei. On 
making these approximations, Eq.(8.147) reduces to 


Te -Ai We A |; 1/2 (w — kzUze — We) 
1+ T [1 — e^ Io(A;)] — eS (A) + in / eae: =0. (8.148) 
The real part of this dispersion gives 
AT Ai 
wr = er oy) (8.149) 
1+ = [1 — e^: Io(A;)] 
or in the limit of small A; 
1—4; 
Wp = BOD, es Tete 3 wl 
w sfa] (8.150) 


this corresponds to the fluid dispersion and moreover shows how finite ion temperature 
affects the dispersion. Using the method of Eq.(8.135), the imaginary part of the frequency 
is obtained from Eq.(8.148) as 


PORS w2r!/2 (wr = kzUze0 =e we) 
t as we Ai Io(A;) KUTe 
_ eH? (wi)? | A-Y) (T/T) + kettzeo ¥ [ery (8.151) 
k-UTe [1 + (1 = YT) els 


where Y = e+ Jg(A;). In the limit A; — 0, it is seen that Y — 1 — A; so that the 
imaginary part of the frequency becomes 


me). 


Wi = 

kzVTe [1 + k? p2] 
The imaginary part of the frequency is therefore always positive so there is always in- 
stability. Two collisionless destabilization mechanisms are seen to exist, normal Landau 
damping represented by the term involving k? (p? +r? ;) in the curly brackets, and current, 
represented by k,u,9/w. Since collisions can also destabilize drift waves there are at least 
three mechanisms by which drift waves can be destabilized. Comparison of Eq.(8.152) 
with Eq.(8.135) shows which of these mechanisms will be dominant in a given plasma. 
Modes with long parallel wavelengths are the most unstable for drift waves driven unstable 
by Landau damping, just as for collisional drift waves. 


5 {ki (p2 + rii) + kettzeo/w}. (8.152) 
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There are many more varieties of drift waves besides the basic versions discussed here. 


The common feature is that the gradient of pressure perpendicular to the magnetic field 
provides a new mode and free energy from the pressure gradient or from parallel current 
can be tapped by this mode so that the mode is spontaneously unstable. 


8.7 Assignments 


. Electrostatic ion cyclotron waves: Using the electrostatic hot plasma dispersion rela- 
tion show that there exists a mode in the vicinity of the ion cyclotron frequency having 
the dispersion 
E 22 
Ww = We +ki ec (8.153) 
Show that this mode can be driven unstable by an axial current. 


. Mode conversion: Work through the algebra of the linear mode conversion problem 
and plot the contours of Ref (p) for both positive and negative £. Hint: It is easier to 
plot the contours of f (p) if one uses polar coordinates in the complex p plane so that 
p = re? . Then the real part of the term p? is just Re(r?e**?) = r3 cos(30). Choose 
a set of contour paths for € < 0 which satisfy the boundary condition that there are no 
modes growing exponentially with increasing distance from € = 0. Then determine 
which combination of these contours does not give an inward propagating hot plasma 
wave when € > 0 (this was another boundary condition). 


3. Prove that the distribution function 


2 
Noc 5\2 2 | VE 
foolz, v) = Ts, exp (v Ug x2) | 5 (Vy /Weo T x)/L (=) | 


provides a mean drift in the y direction corresponding to the diamagnetic drift. 


MHD equilibria 


9.1 Why use MHD? 


Of the three levels of plasma description — Vlasov, two-fluid, and MHD — Vlasov is the 
most accurate and MHD is the least accurate. So, why use MHD? The answer is that 
because MHD is a more macroscopic point of view, it is more efficient to use MHD in 
situations where the greater detail and accuracy of the Vlasov or two-fluid models are 
unnecessary. MHD is particularly suitable for situations having complex geometry because 
it is very difficult to model such situations using the microscopically oriented Vlasov or 
2-fluid approaches and because geometrical complexities are often most important at the 
MHD level of description. The equilibrium and gross stability of three-dimensional, finite- 
extent plasma configurations are typically analyzed using MHD. Issues requiring a 2-fluid 
or a Vlasov point of view can exist and be important, but these more subtle questions can 
be addressed after an approximate understanding has first been achieved using MHD. The 
MHD point of view is especially relevant to situations where magnetic forces are used 
to confine or accelerate plasmas. Examples of such situations include magnetic fusion 
confinement plasmas, solar and astrophysical plasmas, planetary and stellar dynamos, arcs, 
and magnetoplasmadynamic thrusters. Although molten metals are not plasmas, they are 
described by MHD and in fact, the MHD description is actually more appropriate and more 
accurate for molten metals than it is for plasmas. 

We begin our discussion by examining certain general properties of magnetic fields in 
order to develop an intuitive understanding of the various stresses governing MHD equilib- 
rium and stability. The MHD equation of motion, 


aU 
p É J uvu] =J xB-VP, (9.1) 


is a generalization of the equation of motion for an ordinary fluid because it includes the 
J x B magnetic force. Plasma viscosity is normally very small and is usually omitted 
from the MHD equation of motion. However, when torques exist, a viscous damping term 
needs to be included if one wishes to consider equilibria. This is because such a damping is 
required to balance any torque; otherwise the plasma will spin up without limit. Such situ- 
ations will be discussed in Section 9.9; until then, viscosity will assumed to be negligible 
and will be omitted from Eq.(9.1). 
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9.2 Vacuum magnetic fields 


The simplest non-trivial magnetic field results when the magnetic field is produced by 
electric currents located outside the volume of interest and there are no currents in the 
volume of interest. This type of magnetic field is called a vacuum field since it could exist 
in a vacuum. Because there are no local currents, a vacuum field satisfies 


V x Byac=0. (9.2) 


Since the curl of a gradient is always zero, a vacuum field must be the gradient of some 
scalar potential 4%, i.e., the vacuum field can always be expressed as Byae = Vw. For 
this reason vacuum magnetic fields are also called potential magnetic fields. Because all 
magnetic fields must satisfy V - B =0, the potential ~ satisfies Laplace’s equation, 

V2 =0. (9.3) 
Hence the entire mathematical theory of vacuum electrostatic fields can be brought into 
play when studying vacuum magnetic fields. Vacuum electrostatic theory shows that if ei- 
ther w or its normal derivative is specified on the surface S bounding a volume V, then 
w is uniquely determined in V. Also, if an equilibrium configuration has symmetry in 
some direction so that the coefficients of the relevant linearized partial differential equa- 
tions do not depend on this direction, the linearized equations may be Fourier transformed 
in this ‘ignorable’ direction. Vacuum is automatically symmetric in all directions and Pois- 
son’s equation reduces to Laplace’s equation which is intrinsically linear. The linearity and 
symmetry causes Laplace’s equation to reduce to one of the standard equations of math- 
ematical physics. For example, consider a cylindrical configuration with coordinates r, 0, 
and z and suppose that this configuration is axially and azimuthally uniform so that both 6 
and z are ignorable coordinates; i.e., the coefficients of the partial differential equation do 
not depend on 0 or on z. Fourier analysis of Eq.(9.3) implies that ~ can be expressed as 
the linear superposition of modes varying as exp(im6 + ikz). For each choice of m and k, 
Eq. (9.3) becomes 


yY 1 m? 2\ , 

BP 7 Or 72 tk )w=0. (9.4) 
Defining s = kr, this can be recast as 

yY 1dv m\ | 

982 H > Ds 14 oe os 0, (9.5) 


a modified Bessel’s equation. The solutions of Eq.(9.5) are the modified Bessel functions 
In(kr) and Km(kr). Thus, the general solution of Laplace’s equation here is 


w(r, 6, z) es fi am(k)Im(kr) + bm (k) Km (kr)] €+ dk (9.6) 
m=- CoO 
where the coefficients am (k), bm (k) are determined by specifying either w or its normal 
derivative on the bounding surface. Analogous solutions can be found in geometries having 
other symmetries. 

This behavior can be viewed in a more general way. Equation (9.3) states that the sum 
of partial second derivatives in two or three different directions is zero, so at least one of 
these terms must be negative and at least one term must be positive. Since negative Y” /ọ 
corresponds to oscillatory (harmonic) behavior and positive Y” /~ corresponds to exponen- 
tial (non-harmonic) behavior, any solution of Laplace’s equation must be oscillatory in one 
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or two directions (the 0 and z directions for the cylindrical example here), and exponen- 
tially growing or decaying in the remaining direction or directions (the r direction for the 
cylindrical example here). 

Non-vacuum magnetic fields are more complicated than vacuum fields, and unlike vac- 
uum fields, are not uniquely determined by the surface boundary conditions. This is be- 
cause non-vacuum fields are determined by both the current distribution within the volume 
and the surface boundary conditions. Vacuum fields are distinguished from non-vacuum 
fields because vacuum fields are the lowest energy fields satisfying given boundary condi- 
tions on the surface S of a volume V. Let us now prove this statement. 

Consider a volume V bounded by a surface S over which boundary conditions are 
specified. Let Bmin(r) be the magnetic field having the lowest stored magnetic energy of 
all possible magnetic fields satisfying the prescribed boundary conditions. We use methods 
of variational calculus to prove that this lowest energy field is the vacuum field. 


| 
B min 


Figure 9.1: Magnetic field energy for different configurations, Bmin is the configuration 
with minimum magnetic energy for a given boundary condition. 


Consider some slightly different field denoted as B(r) which satisfies the same bound- 
ary conditions as Bmin. This slightly different field can be expressed as B(r) = Bmin(r) + 
ôB (r) where ôB (r) is a small arbitrary variation about Bmin.- This situation is sketched in a 
qualitative fashion in Fig.9.1 where the horizontal axis represents a continuum of different 
allowed choices for the vector function B(r). Since B(r) satisfies the same boundary con- 
ditions as Bmin, 6B(r) must vanish on S. The magnetic energy W associated with B(r) 
can be evaluated as 


Qp49W | (Bmin + 5B)? dër 


t 
£ if B2,,d°r + 2 | ôB - Bmind?r + f (86B)?° d?r. (9.7) 
V yY V 


If the middle term does not vanish, 6B could always be chosen to be antiparallel to Bmin 
inside V so that ôB - Bmin would be negative. Since ôB is assumed small, ôB - Bmin 
would be larger in magnitude than (ôB}’. Such a choice for 6B would make W lower than 
the energy of the assumed lowest energy field, thus contradicting the assumption that Bmin 


9.2 Vacuum magnetic fields 267 


is in fact the lowest energy field. Thus, the only way to ensure that B,,;, is indeed the true 
minimum is to require 


6B-Bmyind?r = 0 (9.8) 


no matter how 6B is chosen. Using 6B =V x 6A and the vector identity V - (A x B) = 
B-V x A— A-VxB, Eq. (9.8) can be integrated by parts to obtain 


| [V - (6A x Bmin) +5A-V x Bmin] d?r = 0. (9.9) 
V 


The first term can be transformed into a surface integral over S using Gauss’s theorem. 
This surface integral vanishes because 6A must vanish on the bounding surface (recall that 
the variation satisfies the same boundary condition as the minimum energy field). Because 
ôB is arbitrary within V, 6A must also be arbitrary within V and so the only way for the 
second term in Eq. (9.9) to vanish is to have V xBmin = 0. Thus, Bmin must be a vacuum 
field. 

An important corollary is as follows: Suppose boundary conditions are specified on the 
surface enclosing some volume. These boundary conditions can be considered as “rules” 
which must be satisfied by any solution to the equations. All configurations satisfying the 
imposed boundary condition and having finite current within the volume are not in the 
lowest energy state. Thus, non-vacuum fields can, in principle, have free energy available 
for driving boundary-condition-preserving instabilities. 


magnetic 
energy 


vacuum field 


force-free field (typical) 


KN 


configuration 


Figure 9.2: Sketch of magnetic energy dependence on configuration of a system with fixed, 
specified boundary conditions. The variation of the configuration would correspond to 
different internal current profiles. The force-free configurations are local energy minima 
while the vacuum configuration has the absolute lowest minimum. 
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9.3 Force-free fields 


Although the vacuum field is the lowest energy configuration satisfying prescribed bound- 
ary conditions, non-vacuum configurations do not always ‘decay’ to this lowest energy 
state. This is because there also exists a family of higher energy configurations to which 
the system may decay; these are the so-called force-free states. The current is not zero in 
a force-free state but the magnetic force is zero because the current density is everywhere 
parallel to the magnetic field. Thus J x B vanishes even though both J and B are finite. 
If a plasma initially in a non-force-free state evolves toward a force-free state it becomes 
‘stuck’ in the force-free state because there are no forces acting to change the equilibrium. 
The magnetic energy of a force-free field is not the absolute minimum energy for the spec- 
ified boundary conditions, but it is a local minimum in configuration space as sketched in 
Fig.9.2. This hierarchy of states is somewhat analogous to the states of a quantum system 
— the vacuum field is the analog of the ground state and the force-free states are the analogs 
of higher energy quantum states. 


magnetic field 
due to first wire 


force due to magnetic field 
of second wire acting on 
current of first wire 


second wire 


force due to magnetic field 
of first wire acting on 
current of second wire 


Figure 9.3: Parallel currents attract each other 


9.4 Magnetic pressure and tension 


Much useful insight can be obtained by considering the force between two parallel current- 
carrying wires as shown in Fig.9.3. Calculation of the field B observed at one wire due to 
current in the other shows that the J x B force is such as to push the wires together; i.e., 
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parallel currents attract each other. Conversely, anti-parallel currents repel each other. 

A bundle of parallel wires as shown in Fig. 9.4 will therefore mutually attract each 
other resulting in an effective net force which acts to reduce the diameter of the bundle. 
The bundle could be replaced by a distributed current such as the current carried by a 
finite-radius, cylindrical plasma. This contracting, inward directed force is called the pinch 
force or the pinch effect. 

The pinch force may be imagined as being due to a ‘tension’ in the azimuthal magnetic 
field which wraps around the distributed current. Extending this metaphor, the azimuthal 
magnetic field is visualized as acting like an ‘elastic band’ which encircles the distributed 
current and squeezes or pinches the current to a smaller diameter. This concept is consistent 
with the situation of two permanent magnets attracting each other when the respective north 
and south poles face each other. The magnetic field lines go from the north pole of one 
magnet to the south pole of the other and so one can pretend that the attraction of the two 
magnets is due to tension in the field lines spanning the gap between the two magnets. 


bundle of parallel wires 
coming out of paper 


radial inward 
‘pinch’ force 


Figure 9.4: Bundle of currents attract each other, giving effective radial inward pinch force. 


Now consider a current-carrying loop such as is shown in Fig.9.5. Because the currents 
on opposite sides of the loop flow in opposing directions, there will be a repulsive force 
between the current element at each point and the current element on the opposite side of 
the loop. The net result is a force directed to expand the diameter of the loop. This force 
is called the hoop force or hoop stress. Hoop force may also be interpreted metaphorically 
by introducing the concept of magnetic pressure. As shown in Fig.9.5, the magnetic field 
lines linking the current are more dense inside the loop than outside, a purely geometrical 
effect resulting from the curvature of the current path. Since magnetic field strength is 
proportional to field line density, the magnetic field is stronger on the inside of the loop 
than on the outside, i.e., B? is stronger on the inside than on the outside. The magnetic 
field in the plane of the loop is normal to the plane and so one can explain the hoop force 
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by assigning a magnetic ‘pressure’ proportional to B? acting in the direction perpendicular 
to B. Because B? is stronger on the inside of the loop than on the outside, there is a 
larger magnetic pressure on the inside. The outward force due to this pressure imbalance is 
consistent with the hoop force. 
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Figure 9.5: Perspective and top view of magnetic fields generated by a current loop. 


The combined effects of magnetic pressure and tension can be visualized by imagining 
a current-carrying loop where the conductor has a finite diameter. Suppose that this loop 
initially has the relative proportions of an automobile tire, as shown in Fig.9.6. The hoop 
force will cause the diameter of the loop to increase, while the pinch force will cause the 
diameter of the conductor to decrease. The net result is that these combined forces will 
cause the automobile tire to evolve towards the relative proportions of a bicycle tire (large 
major radius, small minor radius). 


Figure 9.6: Magnetic forces act to transform a fat small current loop (auto tire) into a large 
skinny loop (bicycle tire) 
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9.5 Magnetic stress tensor 


The existence of magnetic pressure and tension shows that the magnetic force is differ- 
ent in different directions, and so the magnetic force ought to be characterized by an 
anisotropic stress tensor. To establish this mathematically, the vector identity VB?/2 = 
B-VB + BxV xB is invoked so that the magnetic force can be expressed as 


JIxB = —(VxB)xB 
Ho 
2 
a | v(=) -B-VB]| 
Ho 2 
2 
= sity [$ 1- sB] (9.10) 
Ho 2 


where I is the unit tensor and the relation V - (BB) = (V - B) B + B-VB = B-VB has 
been used. At any point r a local Cartesian coordinate system can be defined with z axis 
parallel to the local value of B so that Eq.(9.1) can be written as 


aU 2 
|S tuU] =-V- pees (9.11) 


showing again that the magnetic field acts like a pressure in the directions transverse to B 
(i.e., x,y directions in the local Cartesian system) and like a tension in the direction parallel 
to B. 

While the above interpretation is certainly useful, it can be somewhat misleading be- 
cause it might be interpreted as implying the existence of a force in the direction of B 
when in fact no such force exists because J x B clearly does not have a component in the 
B direction. A more accurate way to visualize the relation between magnetic pressure and 
tension is to rearrange the second line of Eq.(9.10) as 


1 B2 Aae A B? 1 B? 
JxB= | v( ) BYE + BBV ( Ne | vı ( ) + Be 
Ho 2 2 Ho 2 


(9.12) 
or 2 
1 B , 
JxB= | vı ( ) i aa ‘ (9.13) 
Ho 2 
Here . 
=. V= A (9.14) 
R= =O : 


is a measure of the curvature of the magnetic field at a selected point on a field line and, in 
particular, R. is the local radius of curvature vector. The vector R goes from the center of 
curvature to the selected point on the field line. The « term in Eq.(9.13) describes a force 
which tends to straighten out magnetic curvature and is a more precise way for character- 
izing field line tension (recall that tension similarly acts to straighten out curvature). The 
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term involving V B? portrays a magnetic force due to pressure gradients perpendicular 
to the magnetic field and is a more precise expression of the hoop force. 

In our earlier discussion it was shown that the vacuum magnetic field is the lowest 
energy state of all fields satisfying prescribed boundary conditions. A vacuum field might 
be curved for certain boundary conditions (e.g., a permanent magnet with finite dimensions) 
and, in such a case, the two terms in Eq.(9.13) are both finite but exactly cancel each 
other. We can think of the minimum-energy state for given boundary conditions as being 
analogous to the equilibrium state of a system of stiff rubber hoses which have been pre- 
formed into shapes having the morphology of the vacuum magnetic field and which have 
their ends fixed at the bounding surface. Currents will cause the morphology of this system 
to deviate from the equilibrium state, but since any deformation requires work to be done 
on the system, currents invariably cause the system to be in a higher energy state. 


9.6 Flux preservation, energy minimization, and 
inductance 


Another useful way of understanding magnetic field behavior relates to the concept of 
electric circuit inductance. The self-inductance L of a circuit component is defined as 
the magnetic flux ® linking the component divided by the current J flowing through the 


component, 1.e. 


p 
Pos: (9.15) 


Consider an arbitrary short-circuited coil with current J located in an infinite volume V 
and let C denote the three-dimensional spatial contour traced out by the wire constituting 
the coil. The total magnetic flux linked by the turns of the coil can be expressed as 


b= |Bas= | VxAds= $ A-dl (9.16) 
C 


where the surface integral is over the area elements linked by the coil turns. The energy 
contained in the magnetic field produced by the coil is 


B2 
W = f — g?r 
v 2Ho 
1 
= — | B-VxAd?r. (9.17) 
2ho Jv 


However, using the vector identity V - (A x B) = B-VxA — A-VxB this magnetic 
energy can be expressed as 


W = as A-VxBdr + z | ds- A x B (9.18) 
2Ho Jv 2Ho Js- 

where Gauss’s law has been invoked to obtain the second term, an integral over the surface 

at infinity Soo. This surface integral vanishes because (i) at infinity the magnetic field must 

fall off at least as fast as a dipole, i.e., B ~ R? where R is the distance to the origin, 

(ii) the vector potential magnitude A scales as the integral of B so A ~ R~?, and (iii) the 

surface at infinity scales as R?. 
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Using Ampere’s law, Eq.(9.18) can thus be rewritten as 


1 
W= F) A - Jar. (9.19) 
2 V 


However, J is only finite in the coil wire and so the integral reduces to an integral over the 
volume of the wire. A volume element of wire can be expressed as d?r = ds-dl where dl 
is an element of length along the wire, and ds is the cross-sectional area of the wire. Since 
J and dl are parallel, they can be interchanged in Eq.(9.19) which becomes 


W = = A-dl J-ds 


1 
= gL (9.20) 


where J-ds =/ is the current flowing through the wire. Thus, the energy stored in the 
magnetic field produced by a coil is just the inductive energy of the coil. 

If the coil is perfectly short-circuited, then it must be flux conserving, for if there were 
a change in flux, a voltage would appear across the ends of the coil. A closed current 
flowing in a perfectly conducting plasma is thus equivalent to a short-circuited current- 
carrying coil and so the perfectly conducting plasma can be considered as a flux-conserver. 
If flux is conserved, i.e. P = const., the second from last line in Eq.(9.20) shows that the 
magnetic energy of the system will be lowered by any rearrangement of circuit topology 
that increases self-inductance. 

Hot plasmas are reasonably good flux conservers because of their high electrical con- 
ductivity. Thus, any inductance-increasing change in the topology of plasma currents will 
release free energy which could be used to drive an instability. Since forces act so as to re- 
duce the potential energy of a system, magnetic forces due to current flowing in a plasma 
will always act so as to increase the self-inductance of the configuration. One can therefore 
write the force F due to a flux-conserving change in inductance L as 


2 
F=-—v (=). (9.21) 


The pinch force is consistent with this interpretation since the inductance of a conductor 
depends inversely on its radius. The hoop force is also consistent with this interpretation 
since inductance of a current loop increases with the major radius of the loop. More 
complicated behavior can also be explained, especially the kink instability to be discussed 
later. In the kink instability, current initially flowing in a straight line develops an instability 
that causes the current path to become helical. Since a coil (helix) has more inductance than 
a straight length of the same wire, the effect of the kink instability also acts so as to increase 
the circuit self-inductance. 
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9.7 Static versus dynamic equilibria 


We define (i) a static equilibrium to be a time-independent solution to Eq.(9.1) having no 
flow velocity and (ii) a dynamic equilibrium as a solution with steady-state flow velocities. 
Thus U = 0 for a static equilibrium whereas U is finite and steady-state for a dynamic 
equilibrium. For a static equilibrium the MHD equation of motion reduces to 


VP=J xB. (9.22) 


For a dynamic equilibrium the MHD equation of motion reduces to 


pU-VU+VP =J x B+upV?U (9.23) 


where the last term represents a viscous damping and v is the kinematic viscosity. By tak- 
ing the curl of these last two equations we see that for static equilibria the magnetic force 
must be conservative i.e., V x (J x B) = 0 whereas for dynamic equilibria the magnetic 
force is typically not conservative since in general V x (J x B) Æ 0. Thus, the charac- 
ter of the magnetic field is quite different for the two cases. Static equilibria are relevant to 
plasma confinement devices such as tokamaks, stellarators, reversed field pinches, spher- 
omaks while dynamic equilibria are mainly relevant to arcs, jets, magnetoplasmadynamic 
thrusters, but can also be relevant to tokamaks, etc., if there are flows. Both static and 
dynamic equilibria occur in space plasmas. 


cylindrical plasma with 
axial current density J-(r) 


hydrodynamic pressure pushes out 
magnetic force pinches in 


finite pressure 
in cylinder 


Figure 9.7: Geometry of Bennett pinch. 
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9.8.1 Static equilibria in two dimensions: the Bennett pinch 


The simplest static equilibrium was first investigated by Bennett (1934) and is called the 
Bennett pinch or z-pinch (here z refers to the direction of the current). This configuration, 
sketched in Fig.9.7, consists of an infinitely long axisymmetric cylindrical plasma with 
axial current density J, = J,(r) and no other currents. 

The axial current flowing within a circle of radius r is 


I(r)= K 2ar'J,(r')dr' (9.24) 
0 


and the axial current density is related to this integrated current by 


J,(r) = — —. 2 
a 2rr Or (aa 
Since Ampere’s law gives 
I 
Bo(r) = O., (9.26) 
f 2mr 
Eq.(9.22) can be written 
OP ar 
PSIA pA OS (9.27) 


. Or 8r? Or, : Pe 
Integrating Eq. (9.27) from r = 0 to r = a, where a is the outer radius of the cylindrical 


plasma, gives the relation 


“ ,0P a j Hol? (a) 
2 — [2 _ _ Mo 
f m o ie [r?P(r)], — 2 f rP(r)dr = — e (9.28) 
The integrated term vanishes at both r = 0 and r = a since P(a) = 0 by definition. If the 
temperature is uniform, the pressure can be expressed as P(r) = n(r)KT and so Eq.(9.28) 
can be expressed as 


P= 8TNKT 
Ho 

where N = ie n(r)2mrdr is the number of particles per axial length. Equation (9.29), 
called the Bennett relation, shows that the current required to confine a given N and T is 
independent of the details of the internal density profile. This relation describes the sim- 
plest non-trivial MHD equilibrium and suggests that quite modest currents could contain 
substantial plasma pressures. This relation motivated the design of early magnetic fusion 
confinement devices but, as will be seen, it is overly optimistic because simple z-pinch 
equilibria turn out to be highly unstable. 

Confinement using currents that flow in the azimuthal direction is also possible, but 
this configuration, known as a 0-pinch, is fundamentally transient. In a -pinch a rapidly 
changing azimuthal current in a coil going around a cylindrical plasma creates a transient 
B, field as shown in Fig. 9.8. Because the conducting plasma conserves magnetic flux, 
the transient B, field cannot penetrate the plasma and so is confined to the vacuum region 
between the plasma and the coil. This exclusion of the B, field by the plasma requires 


(9.29) 
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the existence in the plasma surface of an induced azimuthal current which creates a mag- 
netic field in the plasma interior that exactly cancels the coil-produced transient B, field. 
The radial confining force results from a radially inward force J x B =J x B,2 where 
B, is the magnetic field associated with the current in the coil and Jọ is the plasma surface 
current. An alternative but equivalent point of view is to invoke the concept that opposite 
currents repel and argue that the azimuthal current in the coil repels the oppositely directed 
azimuthal current in the plasma surface, thereby pushing the plasma inwards and so bal- 
ancing the outwards force due to plasma pressure. The -pinch configuration is necessarily 
transient, because the induced surface current cannot be sustained in steady-state. 


er coil current 


plasma surface current 


Figure 9.8: Theta pinch configuration. Rapidly changing azimuthal coil current induces 
equal and opposite azimuthal current in plasma surface. Pressure of B, between these two 
currents pushes in on plasma and provides confinement. 


9.8.2 Impossibility of self-confinement of current-carrying plasma in three 
dimensions: the virial theorem 


The Bennett analysis showed that axial currents flowing in an infinitely long cylindrical 
plasma generate a pinch force which confines a finite pressure plasma. The inward pinch 
force balances the outward force associated with the pressure gradient. The question now is 
whether this two-dimensional result can be extended to three dimensions; i.e., is it possible 
to have a finite-pressure three-dimensional plasma, as shown in Fig.9.9, which is confined 
entirely by currents circulating within the plasma? To be more specific, is it possible to 
have a finite-radius plasma sphere surrounded by vacuum where the confinement of the 
finite plasma pressure is entirely provided by the magnetic force of currents circulating in 
the plasma; i.e., can the plasma hold itself together by its own ‘bootstraps’? The answer, 
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a resounding ‘no’, is provided by a virial theorem due to Shafranov (1966). A virial is 
a suitably weighted integral over the entirety of a system and contains information about 
how an extensive property such as energy is partitioned in the system. For example, in 
mechanics the virial can be the time average of the potential or of the kinetic energy. 


ball of plasma 
surrounded by vacuum 


Figure 9.9: Plasma sphere with finite pressure surrounded by vacuum 


The MHD virial theorem is obtained by supposing that a self-confining configuration 
exists and then showing this leads to a contradiction. 
We therefore postulate the existence of a spherical plasma with the following properties: 
1. the plasma has finite radius a and is surrounded by vacuum, 


2. the plasma is in static MHD equilibrium, 


3. the plasma has finite internal pressure and the pressure gradient is entirely balanced 
by magnetic forces due to currents circulating in the plasma; i.e. there are no currents 
in the surrounding vacuum region. 

Using Eqs.(9.10) and (9.11) the static MHD equilibrium can be expressed as 


V-T=0 (9.30) 
where the tensor T is defined as 
B? 1 
T=(P+Ž-)1- BB. (9.31) 
20 Ho 


Let r =x%+yy-+ zZ be the vector from the center of the plasma to the point of observation. 
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Consider the virial expression 
V- (Tr) = Sse (Ger) 
(T-r) = (Tar 
Ox jk®k 


(9.32) 
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where Trace T =X jk Tjkôjk. From Eq.(9.31) (or, equivalently from the matrix form in 
Eq.(9.11)) it is seen that 

Trace T =3P + B?/2t1 
is positive definite. 

We now integrate both sides of Eq.(9.32) over all space. Since the right hand side of 
Eq.(9.32) is positive definite, the integral of the right hand side over all space is finite and 
positive. The integral of the left hand side can be transformed to a surface integral at infinity 
using Gauss’ theorem 


JET» = Í ds-T-r 
Soo 
B? 1 
J as (P+ 5) 1- BB T. 
Soo 210 Ho 


Since the plasma is assumed to have finite extent, P is zero on the surface at infinity. 
The magnetic field can be expanded in multipoles with the lowest order multipole being 
a dipole. The magnetic field of a dipole scales as r~* for large r while the surface area 
J ds scales as r?. Thus the left hand side term scales as f dsB?r~r~? and so vanishes 
as r—oo. This is in contradiction to the right hand side being positive definite and so the 
set of initial assumptions must be erroneous. Thus, any finite extent, three-dimensional, 
static plasma equilibrium must involve at least some currents external to the plasma. A 
finite extent, three-dimensional static plasma can therefore only be in equilibrium if at least 
some of the magnetic field is produced by currents in coils that are external to the plasma 
and that are held in place by some mechanical structure. If the coils were not supported 
by a mechanical structure, then the current in the coils could be considered as part of the 
MHD plasma and the virial theorem would be violated. In summary, a finite extent three- 
dimensional plasma in static equilibrium with a finite internal hydrodynamic pressure P 
must ultimately have some tangible exterior object to “push against”. The buttressing is 
provided by magnetic forces acting between currents in external coils and currents in the 
plasma. 


(9.33) 


9.8.3 Three-dimensional static equilibria: the Grad-Shafranov equation 


Despite the simple appearance of Eq.(9.22), its three-dimensional solution is far from triv- 
ial. Before even attempting to find a solution, it is important to decide the appropriate way 
to pose the problem, i.e., it must be decided which quantities are prescribed and which are 
to be solved for. For example, one might imagine prescribing a pressure profile P(r) and 
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then using Eq.(9.22) to determine a corresponding B(r) with associated current J (r); alter- 
natively one could imagine prescribing B(r) and then using Eq.(9.22) to determine P(r). 
Unfortunately, neither of these approaches work in general because solutions to Eq.(9.22) 
only exist for a very limited set of functions. 

The reason why an arbitrary magnetic field cannot be specified is that V x VP = 0 
is always true by virtue of a mathematical identity whereas V x (J x B) = 0 is true 
only for certain types of B(r). It is also true that an arbitrary equilibrium pressure profile 
P(r) cannot be prescribed (see assignment #2 where it is demonstrated that no J x B 
exists which can confine a spherically symmetric pressure profile). Equilibria thus exist 
only for certain specific situations, and these situations typically require symmetry in some 
direction. We shall now examine a very important example, namely static equilibria which 
are azimuthally symmetric about an axis (typically defined as the z axis). This symmetry 
applies to a wide variety of magnetic confinement devices used in magnetic fusion research, 
for example tokamaks, reversed field pinches, spheromaks, and field reversed theta pinches. 

We start the analysis by assuming azimuthal symmetry about the z axis of a cylindri- 
cal coordinate system r, , z so that any physical quantity f has the property 3f /O¢ = 0. 
Because of the identity V x V@ = 0, algebraic manipulations become considerably sim- 
plified if vectors in the ¢ direction are expressed in terms of Vo = / r rather than in terms 
of @. As sketched in Fig.9.10 the term toroidal denotes vectors in the ¢ direction (long way 
around a torus) and the term poloidal denotes vectors in the r — z plane. 


poloidal 
vector 


toroidal 
vector 


Figure 9.10: Toroidal vectors and poloidal vectors. 


The most general form for an axisymmetric magnetic field is 


1 
B =— 
an | 


w(r, z) is called the poloidal flux and I(r, z) is the current linked by a circle of radius r 
with center on the axis at axial location z. The toroidal magnetic field then is 
Hol oy — Hol} 


a VOR Se (9.35) 


Vy x Vo + IVe). (9.34) 


Btor = Boo = 
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showing that the functional form of Eq.(9.34) is consistent with Ampere’s law $ B-dl =pig/. 
The poloidal magnetic field is 


Bpo = — (Vv x V¢). (9.36) 


1 
20 ( 
Integration of the poloidal magnetic field over the area of a circle of radius r with center at 
axial location z gives 


T y 1 
[ Bpoi - ds = | ave x Vo- 22rr'dr' = wr, z); (9.37) 
0 o 40 


thus y(r, z) is the poloidal flux at location r, z. The concept of poloidal flux depends on 
the existence of axisymmetry so that a circle of radius r can always be associated with a 
location r, z. 

Axisymmetry also provides a useful relationship between toroidal and poloidal vectors. 
In particular, the curl of a toroidal vector is poloidal since 


Y x Bior = fey x Vo (9.38) 
and similarly the curl of a poloidal vector is toroidal since 
~ (OB, OB, 
V x Boo = V x (Brf + B22) = - — ]. ; 
x Bpol x (Brf + B,Z) (F =) (9.39) 


The curl of the poloidal magnetic field is a Laplacian-like operator on w since 


1 1 ico 
Vo: V x Boot = V- (Bpo X Ve) = V- & [Vy x Vo] x ve) = Soa (avv) , 
(9.40) 
a relationship established using the vector identity V - (F x G) = G-V x F—F-V xG. 
Because V x Bpo; is purely toroidal and ¢ = rV¢, one can write 
Vx Bool = ee 


2 1 
FA (avv) Vo. (9.41) 


Ampere’s law states that V x B =p) J. Thus, from Eqs. (9.38) and (9.41) the respective 
toroidal and poloidal currents are 


r? 1 : 
Jior = — V. -3 VY Vo (9.42) 
r 
and 1 
Jpol = ZVI x VO. (9.43) 
2T 


We are now in a position to evaluate the magnetic force in Eq.(9.22). After decomposing 
the magnet field and current into toroidal and poloidal components, Eq.(9.22) becomes 


VP = J poi x Bror + Jtor X Bool + J poi x Boo. (9.44) 


The Joi X Byoi term is in the toroidal direction and is the only toroidally directed term on 
the right hand side of the equation. However 0P/0¢ = 0 because all physical quantities 
are independent of o and so Jp X Bpo; must vanish. This implies 


(VI x Vo) x (Vy x Ve) =0 (9.45) 
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which further implies that VJ must be parallel to V7. An arbitrary displacement dr results 
in respective changes in current and poloidal flux dJ = dr-VJ and dw = dr-Vw so that 


dl dr-VI_ 
dy dr-Vq’ 


since VI is parallel to Vw, the derivative dI/dy is always defined. Thus J must be a 
function of w and it is always possible to write 


VI(Y) = T'(4) Vo (9.46) 


where prime means derivative with respect to the argument. The poloidal current can there- 
fore be expressed in terms of the poloidal flux function as 


[' 
Spot = VU x Vo. (9.47) 
2T 


Substitution for the currents and magnetic fields in Eq.(9.44) gives the expression 


d I 2 1 1 
VP = > (Vy x Vg) x vb-Vve—v- (Sv) x = [Vy x Ve] 
T Qn 2T hio r? 2T 
Lo LI' 1 eas . 
= | V- (>Wy || Vy. 
e Bruo E Y* 


(9.48) 
This gives the important result that VP must also be parallel to Vy which in turn implies 
that P = P(w) so that VP = P'Vy. Equation (9.48) now has a common vector factor 
Vw which may be divided out, so that the original vector equation reduces to the scalar 
equation 


Te cares už 
Vv. aVV + 4n? LP’ + gil =0. (9.49) 


This equation, known as the Grad-Shafranov equation (Grad and Rubin 1958, Shafranov 
1966), has the peculiarity that y shows up as both an independent variable and as a de- 
pendent variable, i.e., there are both derivatives of 4 and derivatives with respect to w. 
Axisymmetry has made it possible to transform a three-dimensional vector equation into a 
one-dimensional scalar equation. It is not surprising that axisymmetry would transform a 
three dimensional system into a two dimensional system, but the transformation of a three 
dimensional system into a one dimensional system suggests more profound physics is in- 
volved than just geometrical simplification. 
The Grad-Shafranov equation can be substituted back into Eq.(9.42) to give 


i= (2r P' + ferr) Vo (9.50) 
T 
so that the total current can be expressed as 
T' 
J= Jpoi + Stor = 5-VUx V+ (iR 4 PR) Vo = 2ar?P'Vo+I'B. (9.51) 
T T 


The last term is called the ‘force-free’ current because it is parallel to the magnetic field 
and so provides no force. The first term on the right hand side is the diamagnetic current. 
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The Grad-Shafranov equation is a non-linear equation in % and, in general, cannot 
be solved analytically. It does not in itself determine the equilibrium because it involves 
three independent quantities, Y, P(Y), and (a). Thus, use of the Grad-Shafranov equation 
involves specifying two of these functions and then solving for the third. Typically P(w) 
and (Y) are specified (either determined from other equations or from experimental data) 
and then the Grad-Shafranov equation is used to determine 7). 

Although the Grad-Shafranov equation must in general be solved numerically, there 
exist a limited number of analytic solutions. These can be used as idealized examples that 
demonstrate typical properties of axisymmetric equilibria. We shall now examine one such 
analytic solution, the Solov’ev solution (Solovev 1976). 

The Solov’ev solution is obtained by invoking two assumptions: first, the pressure is 
assumed to be a linear function of 4, 


P= Pyt Ay (9.52) 
and second, J is assumed to be constant within the plasma so 
T =0. (9.53) 


The second assumption corresponds to having all the z-directed current flowing on the 
z-axis, so that away from the z-axis, the toroidal field is a vacuum field (cf. Eq.(9.35)). 
This arrangement is equivalent to having a zero-radius current-carrying wire going up the 
z axis acting as the sole source for the toroidal magnetic field. The region over which the 
Solev’ev solution applies excludes the z—axis and so the problem is analogous to a central 
force problem where the source of the central force is a singularity at the origin. The 
excluded region of the z—axis corresponds to the ‘hole in the doughnut’ of a tokamak. In 
the more general case where T’ is finite, the plasma is either diamagnetic (toroidal field is 
weaker than the vacuum field) or paramagnetic (toroidal field is stronger than the vacuum 
field). 
Using these two simplifying assumptions the Grad-Shafranov equation reduces to 


Ə (100) OY yaa) 
rae G 4 t 5 HAT r ugo = 0 (9.54) 


which has the exact solution (Solov’ev solution) 


eel 
el to 


y(r, z) = Uo (2rg -r?° — 4a?2°) (9.55) 
where po, ro, and a are constants. Figure 9.11 shows a contour plot of w as a function 
of r/ro and z/zo for the case œ = 1. Note that there are three distinct types of curves in 
Fig.9.11, namely (i) open curves going to z = +00, (ii) concentric closed curves, and (iii) 
a single curve called the separatrix which separates the first two types of curves. 
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Figure 9.11: Contours of constant flux of Solov’ ev solution to Grad-Shafranov equation. 


Even though it is an idealization, the Solov’ ev solution illustrates many important fea- 
tures of three-dimensional, static MHD equilibria. Let us now examine some of these 
features. The constants in Eq.(9.55) have been chosen so that Y = Yo atr = ro, z=O0 
(in the figure this point corresponds to r = 1, z = O since lengths in the figure have been 
normalized to ro). The location r = ro, z = 0 is called the magnetic axis because it is the 
axis linked by the closed contours of poloidal flux. 

Let us temporarily assume that 79 is positive. Examination of Eq.(9.55) shows that 
if w is positive, then the quantity 2r2 — r? — 4a?z? is positive and vice versa. For any 
negative value of Y, Eq.(9.55) can be satisfied by making r very small and z very large. In 
particular, if r is infinitesimal then z must become infinite. Thus, all contours for which 7 
is negative go to z = +00; these contours are the open or type (i) contours. 

On the other hand, if 7 is positive then it has a maximum value of Yo which occurs on 
the magnetic axis and if 0 < Y < Wo then Y must be located at some point outside the 
magnetic axis, but inside the curve 2r2 — r? — 4a?z? = 0. Hence all contours of positive 
w must lie inside the curve 2r — r? — da?z? = 0 and correspond to field lines that do not 
go to infinity. These contours are the closed or type (ii) contours, since they form closed 
curves in the r, z plane. 

The contour separating the closed contours from the contours going to infinity is the 
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separatrix and is given by the ellipse 
r? + 4a?z? = Qre. (9.56) 


On the magnetic axis 7 is a local maximum if 79 is positive or a local minimum if Yo 
is negative (the sign depends on the sense of the toroidal current). Thus, one can imagine 
that a hill (or valley) of poloidal flux exists with apex (or bottom) at the magnetic axis 
and in the vicinity of the apex (bottom) the contours of constant ~ are circles or ellipses 
enclosing the magnetic axis. The degree of ellipticity of these surfaces is determined by 
the value of a. These closed flux surfaces are a set of nested toroidal surfaces sharing the 
same magnetic axis. The projection of the total magnetic field lies in a flux surface since 
Eq.(9.34) shows 

B-Vy~ =0, 
i.e., the magnetic field does not have a component in the direction normal to the flux surface. 

Direct substitution of Eq.(9.55) into Eq.(9.54) gives 


= 2 
A= ae (1 +7) (9.57) 
so that the pressure is 
Wo 2 
Pv) =Po+ aq (1 +a°)y. (9.58) 
(v) PO ae 


Pressure vanishes at the plasma edge so if Yeage is the poloidal flux at the edge, then the 
constant Po is determined and Eq.(9.58) becomes 


2y 
P(Y) = a (1+ 0) [vb — Veagel - (9.59) 


In the earlier discussion of single particle motion it was shown that particles were at- 
tached to flux surfaces. Particles that start on a particular flux surface always stay on that 
flux surface, although they may travel to any part of that flux surface. The equilibrium pre- 
scribed by Eq.(9.55) is essentially a three-dimensional vortex with pressure peaking on the 
magnetic axis and then falling monotonically to zero at the edge. 

This is a particularly simple solution to the Grad-Shafranov equation, but nevertheless 
demonstrates several important features: 

1. Three dimensional equilibria can incorporate closed nested poloidal flux surfaces 
which are concentric about a magnetic axis and correspond to local maxima or minima 
of the flux w. 


There can also be open flux surfaces; i.e., flux surfaces which go to infinity. 
The flux surface separating closed and open flux surfaces is called a separatrix. 


The total magnetic field projects into the flux surfaces since B - Vw = 0. 


E a 


Finite pressure corresponds to a depression (or elevation) of the poloidal flux. The 
pressure maximum and flux extrema are located at the magnetic axis. 


6. The poloidal magnetic field and associated poloidal flux ~ are responsible for plasma 
confinement and result from the toroidal current. Thus toroidal current is essential for 
confinement in an axisymmetric geometry. For a given P(w) the toroidal magnetic 


9.8 Static equilibria 285 


field does not contribute to confinement if J’ = 0. If J’ is finite, plasma diamagnetism 
or paramagnetism will affect P(w). Although the toroidal field does not directly pro- 
vide confinement, it can affect the rate of cross-field particle and energy diffusion and 
hence the functional form of P(w). For example, the frequency of drift waves will 
be a function of the toroidal field and drift waves can cause an outward diffusion of 
plasma across flux surfaces, thereby affecting P(w). 


7. The poloidal flux surfaces are related to the surfaces of constant canonical angular 
momentum since Bpa = V x (~V¢/27) = V x (vo/2nr) =V x Ago implies 


Ag = /2rr. Thus, the canonical angular momentum can be expressed as 


Po = mr2o+ qrAg 
qy 
= +=. 9.60 
mrs + > (9.60) 


Surfaces of constant canonical angular momentum correspond to surfaces of constant 
poloidal flux in the limit m — 0. Because the system is toroidally symmetric (axisym- 
metric) the canonical angular momentum of each particle is a constant of the motion 
and so, to the extent that the particles can be approximated as having zero mass, parti- 
cle trajectories are constrained to lie on surfaces of constant 4. Since pg is a conserved 
quantity, the maximum excursion a finite-mass particle can make from a poloidal flux 
surface can be estimated by writing 


pg = 6(mrug + qrAg) = 0. (9.61) 
Thus, 
Ug q.10 
ôr + dug + —4 Ay) =0 62 
or Ud Oo $) (9.62) 
or 
ô 
br = -—— t (9.63) 
Ug + TWe, pol 
where We pot = qBpoi/M is the cyclotron frequency measured using the poloidal 


magnetic field. Since vg is of the order of the thermal velocity or smaller, if the 
poloidal field is sufficiently strong that the poloidal Larmor radius ripo! ~ Vg /We,pol iS 
much smaller than the radius r, then the first term in the denominator may be dropped. 
Since dug is also of the order of the thermal velocity or smaller, the particle cannot 
deviate from its initial flux surface by more than a poloidal Larmor radius. Since 
poloidal field is produced by toroidal current, it is clear that particle confinement to 
axisymmetric flux surfaces requires the existence of a toroidal current. 

Tokamaks, reverse field pinches, spheromaks, and field reversed configurations are all 
magnetic confinement configurations having three dimensional axisymmetric equilibria 
similar to this Solov’ev solution and all involve a toroidal current to produce a set of nested, 
closed poloidal flux surfaces which link a magnetic axis. Stellarators are non-axisymmetric 
configurations which have poloidal flux surfaces without toroidal currents; the advantage 
of current-free operation is offset by the complexity of non-axisymmetry. 
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9.9 Dynamic equilibria: flows 


The basic mechanism driving MHD flows will first be discussed using the simplified as- 
sumptions of incompressibility and self-field only. This situation is relevant to simple arcs, 
basic concepts of magnetoplasmadynamic thrusters, and to currents driven in molten met- 
als. The more general situation where the plasma is compressible and where there are 
external, applied magnetic fields in addition to the self-field will then be addressed. 


9.9.1 Incompressible plasma with self-field only 


MHD-driven flows involve situations where V x(J x B) is non-zero. This means the MHD 
force J x B is non-conservative in which case its finite curl acts as torque or equivalently as 
a source for hydrodynamic vorticity. When the MHD force is non-conservative a closed line 
integral ¢ dl- J x B will be finite, whereas in contrast a closed line integral of a pressure 
gradient $ dl-VP is always zero. Since a pressure gradient cannot balance the torque 
produced by a non-conservative J x B, it is necessary to include a vorticity-damping term 
(viscosity term) to allow for the possibility of balancing this torque. Plasma viscosity is 
mainly due to ion-ion collisions and is typically very small. With the addition of viscosity, 
the MHD equation of motion becomes 


oU 
p (F +U- vu) = J x B-VP+pV°U (9.64) 
where v is the kinematic viscosity. To focus attention on vorticity generation, transport, 
and decay the following simplifying assumptions are made: 


1. The motion is incompressible so that p = const. 


This is an excellent assumption for a molten metal, but is questionable for a plasma. 
However, this assumption could be at least locally reasonable for a plasma if the region 
of vorticity generation is smaller in scale than regions of compression or rarefaction 
or is spatially distinct from these regions. The consequences of compressibility will 
be discussed later. 


2. The system is cylindrically symmetric. 


A cylindrical coordinate system ,@,z can then be used and ¢ is ignorable. This 
geometry corresponds to magnetohydrodynamic thrusters and arcs. 


3. The flow velocities U and the current density J are in the poloidal direction (r and z 
plane). Restricting the current to be poloidal means that the magnetic field is purely 
toroidal (see Fig.9.10). 


The most general poloidal velocity for a constant-density, incompressible fluid has 
the form 


1 
U =— Vy x Vo (9.65) 
20 


where y(r, z) is the flux of fluid through a circle of radius r at location z. Note the analogy 
to the poloidal magnetic flux function used in Eqs.(3.146) and (9.34). Since U lies entirely 
in the r, z plane and since the system is axisymmetric, the curl of U will be in the ¢ 
direction. It is thus useful to define a scalar cylindrical “vorticity” Xx = ro -V x U; this 
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definition differs slightly from the conventional definition of vorticity (curl of velocity) 
because it includes an extra factor of r. This slight change of definition is essentially a 
matter of semantics, but makes the algebra more transparent. 

Since Vo = o/r it is seen that 


VxU= Pod ise U = Vo (9.66) 
T 
and so 
Vv x Cac x ve) =xV ġ. (9.67) 


Dotting Eq.(9.67) with Vo and using the vector identity Ve - V x Q = V- (QxV¢) 
gives 


1 ; xX 
V- (+ (Vy x Vo) x va) =A (9.68) 


or 
2 Eou 
r“V- mY = —2TX (9.69) 


which is a Poisson-like relation between the vorticity and the stream function. The vorticity 
plays the role of the source (charge-density) and the stream function plays the role of the 
potential function (electrostatic potential). However, the elliptic operator is not exactly a 
Laplacian and when expanded has the form 


1 Ə (1av\  &y 
20. ( may Vz 
Be (= ve) "Or E Or ) Oz? Ge 


which is just the operator in the Grad-Shafranov equation. 
Since the current was assumed to be purely poloidal, the magnetic field must be 
purely toroidal and so can be written as 


B=-1v¢ (9.71) 
21 


where J is the current through a circle of radius r at location z. This is consistent with the 
integral form of Ampere’s law, $ B-dl =i9/. The curl of Eq.(9.71) gives 


VxB= "oyy x Vo (9.72) 
T 


showing that J acts like a stream-function for the magnetic field. 
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I(r,z) = const. 


Y 


Figure 9.12: Current channel I(r, z). 


The magnetic force is therefore 


1 Ho Ho ( r ) 
J x B=— (VI x V¢) x —IV¢=- 5Vi—}). (9.73) 
2T ( $) 2T 4 (27r)? 
It is seen that there is an axial force if J? depends on z; this is the essential condition that 
produces axial flows in MHD arcs (Maecker 1955) and plasma guns(Marshall 1960). 
Using the identities VU?/2 = U-VU + Ux Vx U and V x V x U = —V°U =Vx x 
V the equation of motion Eq.(9.64) becomes 


ðU _U? Ho (5) 

— +V— -UxyxVo) = 5V VP VVxx V 9.74 
(Fi 5 x s) Qar) 5 puvx pe (9.74) 

or 

OU U? Ho I’ hol? 1 

—+V—-U V = 5V H zV 
o( Ot + 2 xX 6) (27)? (5) (27)? (z=) 

— VP -— pvVx x Vd. (9.75) 


Every term in this equation is either a gradient of a scalar or else can be expressed as a 
cross-product involving Vø. The equation can thus be regrouped as 


U? I2 Ou I? 
v(! asko Pl (20% pUy——H0 e+ px) xV6=0 


2 (2r)? 2r2 * Qn ôt (27r)? 
(9.76) 
Similarly, the MHD Ohm’s law E + U x B = ņJ can be written as 
OA 
VV +U x IV = -LVI x Vo (9.77) 
Ot 20 20 


where V is the electrostatic potential. Since B is purely toroidal, A is poloidal and so is 
orthogonal to V@. Thus, both the equation of motion and Ohm’s law are equations of the 
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form 

Vgt+QxVo=0. (9.78) 
which is the most general form of an axisymmetric partial differential equation involving a 
potential. Two distinct scalar partial differential equations can be extracted from Eq.(9.78) 
by (i) operating with Vo - Vx and (ii) taking the divergence. Doing the former it is seen 
that Eq.(9.78) becomes 


Vo- V x (QxV¢) =0 (9.79) 
or 


1 
y. (ara) 25 (9.80) 
Applying this procedure to Eq.(9.76) gives 


po LG 1 lol a : 1 = 
oe a (v Zve) pv (z0x) V (eo Z|+pvV 7a VX = 0 (9.81) 


or 


d(x x lig OI? 1 
= (4) + V-(U4)= bu: (Vx). 9.82 
Ot (4) r2 4r?prt Oz ý re X Oe) 
Since V - U = 0, this can also be written as 
d(x X lig OF? 1 
Ea xa U-V ( ) = FVV- =V ; 
at (3) = r2 d4n2pr* Oz eae ke (2:83) 


which shows that if there is no viscosity and if 3I? /0z = 0, then the scaled vorticity x/r? 
convects with the fluid; i.e., is frozen into the fluid. The viscous term on the right hand 
side describes a diffusive-like dissipation of vorticity. The remaining term r~ +01? /ðz acts 
as a vorticity source and is finite only if I? is non-uniform in the z direction. The vortic- 
ity source has a strong r~* weighting factor so that axial non-uniformities of J near r = 0 
dominate. Positive x corresponds to a clockwise rotation in the r, z plane. If I? is an in- 
creasing function of z then the source term is negative, implying that a counterclockwise 
vortex is generated and vice versa. Suppose as shown in Fig.9.12 that the current channel 
becomes wider with increasing z corresponding to a fanning out of the current with in- 
creasing z. In this case ôT? /0z will be negative and a clockwise vortex will be generated. 
Fluid will flow radially inwards at small z, then flow vertically upwards, and finally radi- 
ally outwards at large z. The fluid flow produced by the vorticity source will convect the 
vorticity along with the flow until it is dissipated by viscosity. 


Operating on Ohm’s law, Eq.(9.77), with Vo - Vx gives 


aer) (he) 


showing that 7/r? is similarly convected with the fluid and also has a diffusive-like term, 
this time with coefficient n/ pọ. 
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Thus, the system of equations can be summarized as (Bellan 1992) 


ð X X oa 1 Ho Or? 
Ot (3) T uv (5) raw 7a VX dn prt Az (9.85) 
o/I I n 1 
a (5) ip (= ) (9.86) 
1 
ry (ave) Fe (9.87) 
1 
U = ~VYxVe . (9.88) 
2T 


This system cannot be solved analytically, but its general behavior can be described in a 
qualitative manner. The system involves three scalar variables, x, Y, and I all of which are 
functions of r and z. Boundary conditions must be specified in order to have a well-posed 
problem. If a recirculating flow is driven, the fluid flux ~ will be zero on the boundary 
whereas 7 will be finite on electrodes at the boundary. In particular, the current J will 
typically flow from the anode into the plasma and then from the plasma into the cathode. 
The vorticity x will usually have the boundary condition of vanishing on the bounding 
surface. 


Initially, there is no flow so ~ is zero everywhere. An initial solution of Eq.(9.86) in 
this no-flow situation will establish a current profile between the two electrodes. For any 
reasonable situation, this I(r, z) profile will have 0J?/0z 4 0 so that a vorticity source 
will be created. The source will be quite localized because of the r~* coefficient. Once 
vorticity x is created as determined by Eq.(9.85), the vorticity acts as a source term for the 
fluid flux w in Eq.(9.87), and so a finite y will be developed. Thus a flow U(r, z) will be 
created as specified by Eq.(9.88), and this flow will convect both x/r? and I/r?. 


A good analogy is to think of the r- +0? /0z term as constituting a toroidally sym- 
metric centrifugal pump which accelerates fluid radially from large to small r. This radial 
acceleration takes place at z locations where the current channel radius is constricted. The 
pump then accelerates the ingested fluid up or down the z axis, in a direction away from the 
current constriction. This vortex generation can also be seen by drawing vectors showing 
the magnitude and orientation of the J x B force in the vicinity of a current constriction. 
It is seen that the J x B force is non-conservative and provides a centrifugal pumping as 
described above. 


An arc or magnetoplasmadynamic thruster can thus be construed as a pump which 
sucks fluid radially inward towards the smaller radius electrode and then shoots the fluid 
axially away from the smaller electrode towards the larger radius electrode. The sign of the 
current does not matter, since the pumping action depends only on the z derivative of [?. If 
the equations are put in dimensionless form, it is seen that the characteristic flow velocity 
is of the order of the Alfvén velocity. 
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Once x,~w and I have been determined, it is possible to determine the pressure and 
electrostatic potential profiles. This is done by taking the divergence of Eq.(9.78) to obtain 


V7 =-Vb-VxQ. (9.89) 


The pressure and electrostatic potential are contained within the g term, whereas the Q 
term involves only x, % and J or functions of x, Y% and I (e.g., the velocity). Thus, the right 
hand side of Eq.(9.89) is known and so can be considered as the source for a Poisson-like 
equation for the left hand side. For the equation of motion 


pU? ig P 
Jmotion = 2 j (27)? 2r2 


+ P (9.90) 


so that 


pU? mw P 
2 (2r)? 2r? 


P= YImotion (9.91) 


Regions where gmotion is constant satisfy an extended form of the Bernoulli theorem, 
namely 


U? B? 
=n + Sig + P= const. (9.92) 
0 


Similarly, taking the divergence of Eq.(9.77) gives an equation for the electrostatic po- 
tential (using Coulomb gauge so that V- A = 0). Thus, a complete solution for incompress- 
ible flow is obtained by first solving for x, Y and J using prescribed boundary conditions, 
and then solving for the pressure and electrostatic potential. 


9.9.2 Compressible plasma and applied poloidal field 


The previous discussion assumed that the plasma was incompressible and the magnetic 
field was purely toroidal (i.e., generated by the prescribed poloidal current). The more 
general situation involves having a pre-existing poloidal magnetic field such as would be 
generated by external coils and also allowing for plasma compressibility. This situation 
will now be discussed qualitatively making reference to the sketch provided in Fig.9.13 (a 
more thorough discussion is provided in Bellan (2003)). 
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(a) (b) 
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Figure 9.13: (a) Current channel (r, z) which has same contours as poloidal flux surfaces 
W(r, z) so that VI is parallel to Vw in order to have no toroidal acceleration. The J x B 
force shown by thick arrows is larger at the bottom and canted giving both a higher on-axis 
pressure at the bottom and an axial upwards flow. (b) If the flow stagnates (slows down) for 
some reason, then mass accumulates at the top. The frozen-in convected toroidal flux also 
accumulates which thus increases By and so forces r to decrease since Ampere’s law gives 
2r Ber = I(r,z) which is constant on a surface of constant I(r, z). The upward-moving, 
stagnating plasma flow with embedded toroidal flux acts somewhat like a zipper for the 
surfaces of constant I(r, z) and W(r, z). 


Consider the initial situation shown in Fig.9.13(a) in which a plasma is immersed in 
an axisymmetric vacuum poloidal field y(r, z) with a poloidal current I(r, z). Because Y% 
is assumed to correspond to a vacuum field it is produced by external coils with toroidal 
currents. There are thus no toroidal currents in the volume under consideration (i.e., in 
the region described by Fig.9.13(a)) and so using Eq.(9.42) it is seen that y(r, z) satisfies 
V- (r=?Vy) = 0. The poloidal current I(r, z) has an associated poloidal current density 


1 
Jpol = 5 VI x Vo (9.93) 
2T 
and an associated toroidal magnetic field 
pol(r, 2) 
Bs= à 9.94 
$ = (9.94) 


The poloidal flux u(r, z) has an associated poloidal magnetic field 


1 
Bpo = = VY x Vo. (9.95) 
2T 


If Jo. is parallel to Bpo; then VI would be parallel to Vy so that constant I(r, z) 
would be coincident with constant ~(r, z) surfaces as indicated in the figure. On the other 
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hand if J,,.; were not parallel to Bpo, these surfaces would not be coincident and there 
would be a force Jo, X Bpo, in the toroidal direction would tend to cause a change in the 
toroidal velocity, i.e., an acceleration or deceleration of Uy. We argue that J;,.) x Bpoi must 
be a transient force with zero time average because any current which flows perpendicular 
to the poloidal flux surfaces must be the polarization current as given by Eq.(3.145). This 
polarization current results from the time derivative of the electric field perpendicular to 
the poloidal flux surfaces and this electric field in turn is proportional to the time rate of 
change of the poloidal current as determined from Faraday’s law. Thus, once the current 
is in steady state, it is necessary to have J = I(w) and so the surfaces of constant I(r, z) 
in Fig.9.12 can also be considered as surfaces of constant Y (r, z). Another way to see this, 
is to realize that a steady current perpendicular to flux surfaces would cause a continuous 
electrostatic charging of the flux surfaces which would then violate the stipulation that the 
plasma is quasi-neutral. 

The initial magnetic force is thus the same as for the situation of a purely toroidal 
magnetic field since 

J x B= Jior xX Bpot + Jpoi X Bror- (9.96) 
we 
Zzero 

The remaining force Jpo1 X Bror = (Jp? + J22) x Boo = J ,B2— J; Be? has components 
in both the r and z directions. If the plasma axial length is much larger than its radius (i.e., 
it is long and skinny) then the plasma will develop a local radial pressure balance 


(—VP + Jpoi X Bror) -Ê = 0. (9.97) 


However, this local radial pressure balance precludes the possibility of an axial pressure 
balance if ôy/ðz 4 O because, as discussed earlier, VP cannot balance J x B if the 
latter has a finite curl. Suppose that the radius of the current channel and the poloidal flux 
are both given by a = a(z) and since J = J(w) let us assume a simple linear dependence 
where [gl (r,z) = AW(r, z). Thus, VI is parallel to Vw and J is just the current per flux. 
We assume a parabolic poloidal flux 


(rz) _ ( A (9.98) 


Vo 
forr < a(z) and wo is the flux at r = a; this is the simplest allowed form for the flux (if 
w depended linearly on r, there would be infinite fields at r = 0). The local radial pressure 
balance is essentially a local version of the Bennett pinch relation 


OP 
— = -—J,B 
Or ? 
Y Ay? 
On substitution of Eq.(9.98) this becomes 
OP — maa fr \* 
ðr 8r?r? Or \a(z) 
pe E (9.100) 


27? uo at 
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Integration and using the boundary condition that P = 0 at r = a gives 


2,12 2 
a ees (1 5) (9.101) 


AT? jug a? a? 


which shows that the on-axis pressure is larger where a(z) is smaller as indicated in 
Fig.9.13(a). The axial component of the equation of motion is thus 


F, = 2- (-VP + Jpo X Bior) 
oP 
= = i758 
Oz ji i 
e NMEA A: oe? 1 p OBe 
An? uo Oz (a? at Lo Oe 


ay? 1 \ða 1 8 (xy? 
7 T? ho f 2a3 ` =) Oz 2u Oz (=) 
ee 1 r?\ 0a Wve ð /ry4 
E ( 2 =) ( ) 


Ta? Ho Oz = 8n?r? tg Oz \a 
Lola r? Oa 

= 1 .102 
270703 ( a?) Oz awa 


which shows that there is an axial force which peaks on axis. This force is proportional to 
current flowing along the flux tube and to the axial non-uniformity of the flux tube, i.e., to 
0a/Oz and accelerates plasma away from regions where a is small to regions where a is 
large. 

If the flow stagnates, i.e., is such that the axial velocity is non-uniform so that V - U is 
negative, then there will be a net inflow of matter into the stagnation region and hence an 
increase of mass density at the locations where V - U is negative. Because magnetic flux is 
frozen to the plasma, there will be a corresponding accumulation of toroidal magnetic flux 
at the locations where V-U is negative. If Ug is zero (as assumed on the basis of there being 
no steady toroidal acceleration and zero initial toroidal velocity), then the accumulation of 
toroidal flux will increase the toroidal field. This can be seen by considering the toroidal 
component of the induction equation 


OB. U, B 
PB miv (H) -rouv (B) Bua oa 


or 

—— = —B4V - Upo (9.104) 
where D/DT = 0/0t + U- V and Uy = 0 has been assumed. Thus, it is seen that By 
will increase in the frame of the plasma when V - Upo; is negative. However, Ampere’s 
law gives fig = 27rBg and since J = To at the outer radius of the current channel, it is 
seen that if Bg increases, then the radius of the current channel has to decrease in order to 
keep rBg fixed. The result is that stagnation of the flow tends to collimate the flux tube, 
i.e., make it axially uniform as shown in Fig.9.13(b). The plasma accelerated upwards in 
Fig.9.12 will then squeeze together the ~ and thus J surfaces so that these surfaces will 
be become vertical lines; very roughly stagnation of an upward moving plasma in Fig.9.12 
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can be imagined as a sort of “zipper” which collimates the flux surfaces. Collimation is 
often seen in current-carrying magnetic flux tubes (e.g., see Hansen and Bellan (2001) and 
is an important property of solar coronal loops (Klimchuk 2000) and of astrophysical jets 
(Livio 1999)). 


1. 


9.10 Assignments 


Show that if a Bennett pinch equilibrium has a J, independent of r, then the az- 
imuthal magnetic field is of the form Bg(r) = Be(a)r/a. Then show that if the 
temperature is uniform, the pressure will be of the form P = Po(1 — r?/a?) where 
Po = o1? /4r?a?. Show that this result is consistent with Eq.(9.29). 


Show that it is impossible to confine a spherically symmetric pressure profile using 
magnetic forces alone and therefore show that three dimensional magnetostatic equi- 
libria cannot be found for arbitrarily specified pressure profiles. To do this suppose 
there exists a magnetic field that satisfies J x B = V P where P = P(r) and r is the 
radius in spherical coordinates {r, 0, d}. 


(a) Show that neither J nor B can have a component in the fê direction. Hint: assume 
that B, is finite and write out the three components of J x B = VP in spher- 
ical coordinates. By eliminating either J, or B, from the equations resulting 
from the 0 and ¢ components, obtain equations of the form J,0P/dr = 0 and 
B,OP/Or = 0. Then use the fact that OP/Or is finite to develop a contradiction 
regarding the original assumption regarding J, or By. 


(b 


wm 


By calculating Jọ and J, from Ampere’s law (use the curl in spherical coordi- 
nates as given on p.523), show that J x B = VP implies B3 +B = B?(r). Ar- 
gue that this means that the magnitude of B =Bob + Boo must be independent 
of 0 and ¢ and therefore Bg and By must be of the form Bg = B sin [n(0, ,r)] 
and By = B cos [n(0,¢,1r)] where 7 is some arbitrary function. 


(c) Using J, = 0, B, = 0 and V- B = 0 (see p. 523 for the latter) derive a pair 
of coupled equations for 07/00 and 07/0¢. Solve these coupled equations for 
0n/00 and On /O¢@ using the method of determinants (the solutions for 07/00 
and 07 /0¢ should be very simple). Integrate the 07 /3¢ equation to obtain n and 
show that this solution for 77 is inconsistent with the requirements of the solution 
for 07/00, thereby demonstrating that the original assumption of existence of a 
spherically symmetric equilibrium must be incorrect. 


3. Using numerical methods solve for the motion of charged particles in the Solov’ev 


field, B = Vw x Vo where y = Bor?(2a? — r? — 4(az)*)/a* where Bo, a and 
q are constants. Plot the surfaces of constant % and show there are open and closed 
surfaces. Select appropriate characteristic times, lengths, and velocities and choose 
appropriate time steps, initial conditions, and graph windows. Plot the x,y plane and 
the x, z plane. What interesting features are observed in the orbits. How do they relate 
to the flux surfaces ~ = const.What can be said about flux conservation? 


4. Grad-Shafranov equation for a current-carrying flux tube:The Grad- 
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Shafranov equation describes axisymmetric equilibria and so can be used to charac- 
terize an axisymmetric magnetic flux tube. Suppose that Y = w(r, z) and that ~ is a 
monotonically increasing function of r. Assume that both the current J and pressure 
P are linear functions of ~ so that 


Mol = AV 
P Po(1 — Y/vo) 


where po is the flux surface at which P vanishes and Pp is the pressure on the z axis. 
Show that the Grad-Shafranov equation can be written in the form 


Leg 2-  4n?r?uoPo 
ey. (ave) hoa an 
Vo 
where 4% = y(r, z)/Wo- Show that if 


p = W 
An? hoa? 
where a is the radius at which the pressure vanishes at z = 0, the flux tube must 
be axially uniform, i.e., ~ cannot have any z dependence. Express this condition in 
terms of the pitch angle of the magnetic field B,/B, measured at r = a. Hint: note 
that the Grad-Shafranov equation has a both homogeneous part (left hand side) and 
inhomogeneous part (right hand side). Show that ~ = r?/a? is a particular solution 
and then consider the form of the general solution (homogenous plus inhomogeneous 
solution). 


Lawson Criterion: Of the many possible thermonuclear reactions, the deuterium- 
tritium (DT) reaction stands out as being most attractive because it has the largest 
reaction cross-section at accessible temperatures. The DT reaction has the form 


D+T n +He* + 17.6 MeV. 


The output energy consists of 3.5 MeV neutron kinetic energy and 14.1 MeV alpha 
particle kinetic energy. Fusion reaction cross-sections have a very strong dependence 
on the impact energy of the reacting ions because of the Coulomb barrier. In order for 
a controlled fusion reaction to be economically useful, more energy must be generated 
than is invested. The break-even condition is determined by equating the energy input 
per volume to the energy output for the reaction duration. The time duration that this 
condition is maintained is called the energy confinement time T p of the configuration, 
a measure of the insulating capability of the plasma confinement device. Let all tem- 
peratures and energies be measured in kilovolts (keV) so that the Boltzmann constant 
is x = 1.6 x 10716 and consider a volume V of reacting deuterium, tritium and asso- 
ciated neutralizing electrons (the plasma needs to be overall neutral, otherwise there 
will be enormous electrostatic forces). 


(a) Show that if the electrons and ions have the same temperature T the energy 
required to heat this volume is Ein ~ 3neVKT where ne is the electron density 
and equal densities of deuterium and tritium are assumed so that np = np = 


(b 


wm 


(c) 


(d 


wm 


(e) 


(f) 
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Ne /2.(Hint: the mean energy per degree of freedom is K7'/2 so the mean energy 
of a particle moving in three dimensional space is 34T/2). 


What is the fusion reaction rate R for a single incident ion of velocity v pass- 
ing through stationary target ions where o(v) is the fusion reaction rate cross- 
section and n; is the density of the target ions (assumed stationary). If < 
a(v)uv > denotes the velocity average of o(v)v what is the velocity-averaged 
reaction rate? 


Assume that the reaction of hot tritium with hot deuterium can be approximated 
as the sum of hot deuterium interacting with stationary tritium and hot tritium 
interacting with stationary deuterium. Show that the fusion output power in the 
volume is 

P=npnr < a(v)u > V Ereaction 
where Epeaction = 17.6 x 10° keV. 


Show that the ratio Q of the fusion output energy to the input energy satisfies the 


relation 
6.8 x 10-*Trev 


<o(v)v > 


NeTE~ Q 
At break-even Q = 1. 


For a thermal ion distribution and the experimentally measured reaction cross- 
section o (v) the velocity averaged rate integral has the following values 


Temperature in keV | < o(v)u > in m/s | neTg 
1 5.5 x 10777 
2 2.6 x 10775 
5 1.3 x 107?3 
10 1.1 x 10-7? 
20 4.2 x 1072 
50 8.7 x 10722 
100 8.5 x 10722 


Calculate the n.7 p required for break-even at the listed temperatures (i.e. fill in 
the last column). 


The Joint European Tokamak (JET) located in Culham, UK (in operation during 
the latter 20th and early 21st century) has a plasma volume of approximately 80 
cubic meters. Suppose JET has an energy confinement time of Tz ~ 0.3 s at 
ne ~ 107° m~? and T ~ 10 keV. If the plasma is a 50%-50% mixture of D-T 
how much fusion power would be produced in JET and what is Q? What is the 
weight of the plasma? How much does the confinement time have to be scaled 
up to achieve break-even? How much does the volume have to be scaled up to 
achieve 2000 MWt power output at breakeven? In order to achieve this volume 
scale-up, how much does the linear dimension have to be scaled up? 


10 


Stability of static MHD equilibria 


Solutions to Eq.(9.49), the Grad-Shafranov equation, (or to some more complicated coun- 
terpart in the case of non-axisymmetric geometry) provide a static MHD equilibrium. The 
question now arises whether the equilibrium is stable. This issue was forced upon early 
magnetic fusion researchers who found that plasma which was expected to be well-confined 
in a static MHD equilibrium configuration would instead became violently unstable and 
crash destructively into the wall in a few microseconds. 


The difference between stable and unstable equilibria is shown schematically in Fig.10.1. 
Here a ball, representing the plasma, is located at either the bottom of a valley or the top of 
a hill. If the ball is at the bottom of a valley, i.e., a minimum in the potential energy, then a 
slight lateral displacement results in a restoring force which pushes the ball back. The ball 
then overshoots and oscillates about the minimum with a constant amplitude because en- 
ergy is conserved. On the other hand, if the ball is initially located at the top of a hill, then 
a slight lateral displacement results in a force that pushes the ball further to the side so that 
there is an increase in the velocity. The perturbed force is not restoring, but rather the op- 
posite. The velocity is always in the direction of the original displacement; i.e., there is no 
oscillation in velocity. 


stable unstable 
equilibrium equilibrium 


Figure 10.1: Stable and unstable equilibria 
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The equation of motion for this system is 

d?r 

ae 

where « is assumed positive, the plus sign is chosen for the ball on hill case, and the minus 

sign is chosen for the ball in valley case. This equation has a solution x ~ exp(—iwt) 

where w = +,/K/m for the valley case and w = +i,/«/m for the hill case. The hill 

solution w = —i,/«/m is unstable and corresponds to the ball accelerating down the hill 
when perturbed from its initial equilibrium position. 

If this configuration is extended to two dimensions, then stability would require an 
absolute minimum in both directions. However, a saddle point potential would suffice for 
instability, since the ball could always roll down from the saddle point. Thus, a multi- 
dimensional system can only be stable if the equilibrium potential energy corresponds to 
an absolute minimum with respect to all possible displacements. 

The problem of determining MHD stability is analogous to having a ball on a multi- 
dimensional hill. If the potential energy of the system increases for any allowed perturba- 
tion of the system, then the system is stable. However, if there exists even a single allowed 
perturbation that decreases the system’s potential energy, then the system is unstable. 


= TK (10.1) 


10.1 The Rayleigh-Taylor instability of hydrodynamics 


An important subset of MHD instabilities is formally similar to the Rayleigh-Taylor in- 
stability of hydrodynamics; it is therefore useful to put aside MHD for the moment and 
examine this classical problem. As any toddler learns from stacking building blocks, it is 
possible to construct an equilibrium whereby a heavy object is supported by a light ob- 
ject, but such an equilibrium is unstable. The corresponding hydrodynamic situation has 
a heavy fluid supported by a light fluid as shown in Fig.10.2(a); this situation is unstable 
with respect to the rippling shown in Fig.10.2(b). The ripples are unstable because they 
effectively interchange volume elements of heavy fluid with equivalent volume elements 
of light fluid. Each volume element of interchanged heavy fluid originally had its center 
of mass a distance A above the interface while each volume element of interchanged light 
fluid originally had its center of mass a distance A below the interface. Since the potential 
energy of a mass m at height h in a gravitational field g is mgh the respective changes in 
potential energy of the heavy and light fluids are, 
dW, = —2p,V Ag, W, = +29,V Ag (10.2) 
where V is the volume of the interchanged fluid elements and p, and p, are the mass 
densities of the heavy and light fluids. The net change in the total system potential energy 
is 
OW = —2 (Pn — P) VAg (10.3) 
which is negative so that the system lowers its potential energy by forming ripples. This is 
analogous to the ball falling off the top of the hill. 

A well-known example of this instability is the situation of an inverted glass of wa- 
ter. The heavy fluid in this case is the water and the light fluid is the air. The system is 
stable when a piece of cardboard is located at the interface between the water and the air, 
but when the cardboard is removed the system becomes unstable and the water falls out. 
The function of the cardboard is to prevent ripple interchange from occurring. The sys- 
tem is in stable equilibrium when ripples are prevented because atmospheric pressure is 
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adequate to support the inverted water. From a mathematical point of view, the cardboard 
places a constraint on the system by imposing a boundary condition which prevents ripple 
formation. 

When the cardboard is removed so that there is no longer a constraint against ripple 
formation, the ripples grow from noise to large amplitude, and the water falls out. This is 
an example of an unstable equilibrium. 


(a) (b) 


y=h 
heavy fluid heavy fluid 7 

y=0 —.k 
light fluid light fluid wavenumber 
N X Y Y of perturbation 
equilibrium perturbation 


Figure 10.2: (a)Top-heavy fluid equilibrium, (b) rippling instability. 


The geometry shown in Fig.10.2 is now used to analyze the stability of a heavy fluid 
supported by a light fluid in the situation where no constraint exists at the interface. Here 
y corresponds to the vertical direction so that gravity is in the negative y direction. To 
simplify the analysis, it is assumed that p, << pp in which case the mass of the light fluid 
can be ignored. The water and air are assumed to be incompressible, i.e., pọ = const., so 
that the continuity equation 


O 
L +v-Vp+pV-v =0 (10.4) 
reduces to 
V-v=0. (10.5) 
The linearized continuity equation in the water therefore reduces to 
Op 
a + vı: Vpo = 0 (10.6) 


and the linearized equation of motion in the water is 


Ovi 

Po Ae 

The location y = O is defined to be at the unperturbed air-water interface and the top of the 

glass is at y = h. The water fills the glass to the top and thus is constrained from moving 
at the top, giving the top boundary condition 


= -VP, — pig. (10.7) 


Vy = Oat y =h. (10.8) 
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The perturbation is assumed to have the form 


vı = vı (y)e ti x (10.9) 


where k lies in the x — z plane and positive y implies instability. The incompressibility 
condition, Eq.(10.5), can thus be written as 


Ov, y 
Oy 


where | means perpendicular to the y direction. The y and | components of Eq.(10.7) 
become respectively 


+ik-v,, =0 (10.10) 


aP, 
WPorty =~ Ber — Prd (10.11) 
VpoVi. = —ikP,. (10.12) 


The system is solved by dotting Eq.(10.12) with ik and then using Eq. (10.10) to eliminate 
k - vı | and so obtain 


ð l 
-yo 5 = KP. (10.13) 
The perturbed density, as given by Eq.(10.6), is 
Apo 
YP1 = Huay (10.14) 
Next, p, and P, are substituted for in Eq. (10.11) to obtain the eigenvalue equation 
ð | ə Wy 2 Apo) ,2 
— = — gg | kfviy. 10.15 
ay h Po dy Y Po — 9 By Vly ( ) 


This equation is solved by considering the interior and the interface separately: 
1. Interior: Here 0p) /Oy = 0 and py = const. in which case Eq.(10.15) becomes 


= k?viy . (10.16) 
The solution satisfying the boundary condition given by Eq.(10.8) is 
vıy = Asinh(k(y — h)). (10.17) 


2. Interface: To find the properties of this region, Eq.(10.15) is integrated across the 
interface from y = O_ toy = 0, to obtain 


2 Viy re 2 0+ 
Y Po = — [9pok*v1y] o (10.18) 
Y lo 
or ə 
UL 
or = —gk* ny (10.19) 


where all quantities refer to the upper (water) side of the interface, since by assumption 
poly = 0_) =0. 
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Substitution of Eq.(10.17) into Eq.(10.19) gives the dispersion relation 


y? = kgtanh(k h) (10.20) 


which shows that the configuration is always unstable since y? > 0. Equation (10.20) 
furthermore shows that short wavelengths are most unstable, but a more detailed analysis 
taking into account surface tension (which is stronger for shorter wavelengths) would show 
that y? has a maximum at some wavelength. Above this most unstable wavelength, surface 
tension would decrease the growth rate. 


10.2 MHD Rayleigh-Taylor instability 


We define a ‘magnetofluid’ as a fluid which satisfies the MHD equations. A given plasma 
may or may not behave as a magnetofluid, depending on the validity of the MHD approx- 
imation for the circumstances of the given plasma. The magnetofluid concept provides 
a legalism which allows consideration of the implications of MHD without necessarily 
accepting that these implications are relevant to a specific actual plasma. In effect, the 
magnetofluid concept can be considered as a tentative model of plasma. 

Let us now replace the water in the Rayleigh-Taylor instability by magnetofluid. We 
further suppose that instead of atmospheric pressure supporting the magnetofluid, a vertical 
magnetic field gradient balances the downwards gravitational force, i.e., at each y the up- 
ward force of — V B? /2 o supports the downward force of the weight of the plasma above. 
Although gravity is normally unimportant in actual plasmas, the gravitational model is nev- 
ertheless quite useful for characterizing situations of practical interest, because gravity can 
be considered as a proxy for the actual forces which typically have a more complex struc- 
ture. An important example is the centrifugal force associated with thermal particle motion 
along curved magnetic field lines acting like a gravitational force in the direction of the ra- 
dius of curvature of these field lines. The curved field with associated centrifugal force due 
to parallel thermal motion is replaced by a Cartesian geometry model having straight field 
lines and, perpendicular to the field lines, a gravitational force is invoked to represent the 
effect of the centrifugal force. The y direction corresponds to the direction of the radius of 
curvature. 

In order for — V B? to point upwards in the y direction, the magnetic field must depend 
on y such that its magnitude decreases with increasing y. Furthermore it is required that 
B, = 0 so that VB? is perpendicular to the magnetic field and the field can be considered 
as locally straight (field line curvature has already been taken into account by introducing 
the fictitious gravity). Thus, the equilibrium magnetic field is assumed to be of the general 
form 


Bo = Baol) + Boly)2. (10.21) 
The unit vector associated with the equilibrium field is 
A Broly) + Bzoly)Z 


(10.22) 


9 Baol)? + Baou)? 


For the special case where B,o(y) and Bo(y) are proportional to each other, the field line 
direction is independent of y, but in the more general case where this is not so, Bo depends 
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on y and thus rotates as a function of y. In this latter case, the magnetic field lines are said 
to be sheared, since adjacent y-layers of field lines are not parallel to each other. 

The magnetofluid is assumed incompressible and so using Eq.(9.10) the linearized 
equation of motion becomes 
Bo - VB, +B,- VBo 


Ovi -= 
=-VP,4 199 10.23 
Po aL 1 E Pigy ( ) 


where Ben 
Pı = Pı + e 
0 
is the perturbation of the combined hydrodynamic and magnetic pressure, i.e., the perturba- 
tion of P + B?/2j19. It is again assumed that all quantities vary in the manner of Eq.(10.9) 
so Eq.(10.23) has the respective y and L components 


ðP, i(k-Bo)B 
YWod1y = ay | l = Y pg (10.24) 


mE 1 OBo 
YPoVıL Da r i(k- Bo) Bit + By By (10.25) 


In analogy to the glass of water problem. Eq.(10.25) is dotted with ik and Eq.(10.10) is 
invoked to obtain 


Wy 25 1 ; ð (k - Bo) 
— = kP, + — |—(k-Bo)k-B Biy, ———— | . 10.2 
YPo Oy ee ( 0) 1. +iBiy Dy (10.26) 
Because V - Bı = 0, the perturbed perpendicular field is 
i OBy 
ik- Bı, =— (10.27) 
Oy 
so that Eq.(10.26) can be recast as 
= vı 1 k OB, $ O (k : Bo) 
k? P, = y k-B 2 4 iB; . 10.28 
LIPO Bae aie | i(k- Bo) a T (10.28) 


Following a procedure analogous to that used in the inverted glass of water problem, P, is 
eliminated in Eq.(10.24) by substitution of Eq. (10.28) to obtain 


1 ð du, 1f, OB, ð (ik - Bo) 
= k-B HB 
YPoUly k2 Oy { YPo oy ho | (i o) Oy ly Oy 
i(k- Bo) Bry y 
Ho ! 


(10.29) 
To proceed further, it is necessary to know Bıy. The complete vector B; is found by first 
linearizing the MHD Ohm’s law to obtain 


Eı + vi x Bo = 0, (10.30) 
then taking the curl, and finally using Faraday’s law to obtain 


7B, = V x [vı x Bol. (10.31) 
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The y component is found by dotting with ĝ and then using the vector identity V-(F x G) = 
G-Vx F-F-V x Gto obtain 


By = Ñ- V x [vi x Bo] = V- [(vi x Bo) x ĝ] = V - [vtyBo] = ik - Bovıy. (10.32) 


Substituting into Eq.(10.29) using Eq.(10.32) and Eq.(10.14) and rearranging the order 
gives 


ð 2 1 | Wy l 2) ,2 ðpo , (k- Bo)” 
za +—(k-B —Y\=k g H vıy (10.33) 
By { h Po eBo) a YPo— 95, ie ly 


which is identical to the inverted glass of water problem if k - Bo = 0. 

Rather than have an abrupt interface between a heavy and light fluid as in the glass 
of water problem, it is assumed that the magnetofluid fills the container between y = 0 
and y = h and that there is a density gradient in the y direction. This situation is more 
appropriate for a plasma which would typically have a continuous density gradient. It is 
assumed that rigid boundaries exist at both y = 0 and y = h so that vıy = O at both 
y = Oand y = h. These boundary conditions mean that rippling is not allowed at y = 0, h 
but could occur in the interior, 0 < y < h. Equation (10.33) is now impossible to solve 
analytically because all the coefficients are functions of y. However, an approximate 
understanding for the behavior predicted by this equation can be found by multiplying the 
equation by vıy and integrating from y = 0 to y = h to obtain 


1 avi, ]" i 1 Av1,\? 
2 2 ly 2 2 ly 
yY Po + — (k - Bo bor | -J [e + — (k - Bo (3) dy 
f £ Th ) ” Oy 0 o £ a oy 
h 
f O k-Bo ; 
=” f fano g Po 4 EBO | 2 ay, 
0 Ho 


Oy 
(10.34) 
The integrated term vanishes because of the boundary conditions (which could also have 
been vı; /Əy = 0). Solving for 7? gives 


(10.35) 


If k - Bo = 0 and the density gradient is positive everywhere then 7? > 0 so there 
is instability. If the density gradient is negative everywhere except at one stratum with 
thickness Ay, then the system will be unstable with respect to an interchange at that one 
‘top-heavy’ stratum. The velocity will be concentrated at this unstable stratum and so 
the integrands will vanish everywhere except at the unstable stratum giving a growth rate 
yY ~ gAypo 1 Op, /3y where po/Əy is the value in the unstable region. This MHD 
version of the Rayleigh-Taylor instability is called the Kruskal-Schwarzschild instability 
(Kruskal and Schwarzschild 1954). 
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Finite k - Bo opposes the effect of the destabilizing positive density gradient, reducing 
the growth rate to 7? ~ g Ay po! Opo/Oy — (k- Bo)” /Lo. A sufficiently strong field 
will stabilize the system. However, thermally excited noise excites modes with all possible 
values of k and those modes aligned such that k - Bo = 0 will not be stabilized. A shear in 
the magnetic field has the effect of making it possible to have k - Bo(y) = Oat only a single 
value of y. The lack of any spatial dependence of k results from the translational invariance 
of the plasma with respect to both x and z implying that k =k, + k,Z is independent of 
position. Thus, magnetic shear constrains the instability to a narrow y stratum. 

The stabilizing effect of finite k- Bo can be understood by considering Eq.(10.32) 
which shows that the amount of Bıy associated with a given vıy is proportional to k - Bo. 
Since the original equilibrium field had no y component, introducing a finite Bıy corre- 
sponds to ‘plucking’ the equilibrium field. The plucking varies sinusoidally along the equi- 
librium field and stretches the equilibrium field like a plucked violin string. The plucked 
field has a restoring force which pulls the field back to its equilibrium position. If the energy 
associated with plucking the magnetic field exceeds the energy liberated by the interchange 
of heavy upper magnetofluid with light lower magnetofluid, the mode is stable. 


last last 
flux flux 
saree surface 
ie ‘plasma 
vacuum vacuum 
‘good’ curvature ‘bad’ curvature 


Figure 10.3: Good and bad curvature of the plasma-vacuum interface. 


As discussed earlier, the gravitational force in the magnetofluid model represents the 
centrifugal force resulting from guiding center motion along curved field lines. This leads 
to the concept of ‘good’ and ‘bad’ curvature illustrated in Fig.10.3. A plasma has bad 
curvature if the field lines at the plasma-vacuum boundary have a convex shape as seen 
by an observer outside the plasma since in this case the centrifugal force is outwards from 
the plasma. Bad curvature gives a centrifugal force that can drive interchange instabilities 
whereas good curvature corresponds to a concave shape so that the centrifugal force is 
always inwards. Mirror magnetic fields as sketched in Fig.10.4 have good curvature in 
the vicinity of the mirrors and bad curvature in the vicinity of the mirror minimum so that 
a detailed analysis of interchange instabilities requires averaging the ‘goodness/badness’ 
along the portion of the flux tube experienced by the particle. Cusp magnetic fields (cf. 
Fig. 10.4) have good curvature everywhere, but have singular behavior at the cusps. Plasmas 
with internal currents such as tokamaks have significant shear everywhere since 0B, /Oy ~ 
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J, and B, /ðy ~ Jz; this shear inhibits interchange instabilities. However, as will be 
shown later this current can be the source for another class of instability called a current- 
driven instability. 


bad curvature 


NA good curvature good curvature 
a LAE 
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mirror field cusp field 


Figure 10.4: Mirror fields have bad curvature at the mirror minimum, good curvature near 
the maximum; cusp fields have good curvature everywhere. 


10.3 The MHD energy principle 


Suppose a system initially in equilibrium is subject to a small perturbation instigated by 
random thermal noise. The perturbation will affect all the various dependent variables in 
a way which must be consistent with all the relevant equations and boundary conditions. 
Thus, the perturbation may be considered as a low-level excitation of some allowed normal 
mode of the system. The mode will be unstable if it reduces the potential energy of the sys- 
tem and, if unstable, the mode growth rate will be proportional to the amount by which the 
potential energy is reduced. In order to investigate whether a given system is stable, it suf- 
fices to show that no modes exist which reduce the system potential energy. Demonstrating 
MHD stability this way was first done by Bernstein, Frieman, Kruskal and Kuslrud (1958) 
and the method is called the MHD energy principle. 

Each mode has its own specific pattern for displacing the magnetofluid volume elements 
from their equilibrium positions. The pattern of displacements can be represented by a 
vector function of position (x) that prescribes how a fluid volume element originally 
at location x is displaced. Thus, the mode involves moving a fluid element initially at 
x to the position x + (x). The displacement of a magnetofluid element initially at the 
surface is shown in Fig. 10.5, a sketch of a two-dimensional cut of a three dimensional 
magnetofluid (plasma) surrounded by a vacuum region in turn bounded by a conducting 
wall. The wall could be brought right up to the magnetofluid to eliminate the vacuum 
region or, alternatively, the wall could be placed at infinity to represent a system having no 
wall and surrounded by vacuum. 
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perturbed perfectly 
surface conducting 
wall 


Figure 10.5: Two-dimensional cut of three-dimensional plasma equilibrium and perturba- 
tion; Gaussian pillbox is used to relate quantities in vacuum to quantities in plasma. Each 
volume element at a position x is displaced by an amount €(x). The displacement of a 
volume element at the surface is illustrated. 


10.3.1 Energy equation for a magnetofluid 


The energy content of a magnetofluid can be obtained from the ideal MHD equations if it is 
assumed that all motions are sufficiently fast to be adiabatic but slow enough for collisions 
to keep the pressure isotropic. The ideal MHD equations are 


p(T +U VU) =1xB-VP (10.36) 
E+ Ux B=0 (10.37) 
OB 
VxE=-—— 10. 
x AE (10.38) 


Vx B =u,J (10.39) 
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ð 
P |Y. (pU)=0 (10.40) 
Ot 
P~ ø. (10.41) 
Ohm’s law, Eq.(10.37), and Faraday’s law, Eq. (10.38), are combined to give the induction 


equation 

OB 
which prescribes the magnetic field evolution and, as discussed in the context of Eq.(2.82), 
shows that the magnetic flux is frozen into the magnetofluid. Equation (10.41), the adi- 
abatic relation, implies the following relationships for spatial and temporal derivatives of 
density and pressure 


(10.43) 


Combining these relationships with the continuity equation, Eq.(10.40), gives the pressure 
evolution equation 


OP 
BE +U-VP+yPV-U=0 (10.44) 
which can also be written as 
OP 
met V-(PU) +(7-1) PV-U=0. (10.45) 


An expression for the overall energy can be derived by first writing the equation of motion, 
Eq.(10.36), as 


OU 
PF + pV 5 pUxVxU=JIxB-VP (10.46) 
and then dotting Eq.(10.46) with U to obtain 
ð / V? U? 
pa (4) +00-v (F )=IxB-U-UvP. (10.47) 


Multiplying the entire continuity equation by U?/2 and adding the result gives 


2 2 
x (Æ) ! (A u) = J- UxB- V. (PU)+PV-U. (IDAS) 


Using Ampere’s law, Eq.(10.39), to eliminate J, then Ohm’s law, Eq.(10.37), to eliminate 
U x B, and the pressure evolution equation, Eq.(10.45), to eliminate PV - U, this energy 
equation becomes 


Of pu? pU? (V x B) 1 OP 
EV- U)= -E— V-(PU + V-(PU 
ree G Lo l GD (a m 
(10.49) 
Finally, using the vector identity 
V-(ExB) = B-VxE-E.VxB 
= -—B. OB se 58 (10.50) 
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Eq.(10.49) can be written as 


Əə (pU2 P B U? ExB 
P> ' Eval pe yT pu) En As 
Ot \ 2 y-1  2upo 2 Lg y-1 


This is a conservation equation relating energy density and energy flux. The time deriva- 
tive operates on the magnetofluid energy density and the divergence operates on the energy 
flux. The energy density is comprised of kinetic energy density pU/2, thermal energy 
density P/(y — 1), and magnetic energy B?/2j1,. The energy flux is comprised of convec- 
tion of kinetic energy pU?U/2, the Poynting vector E x B / uo, and convection of thermal 
energy density yPU/(y — 1). 

The pressure and density both vanish at the magnetofluid surface, but the Poynting 
flux E x B can be finite. However, if the tangential electric field vanishes at the surface, 
then the Poynting flux normal to the surface will be zero. This situation occurs if the 
magnetofluid is bounded by a perfectly conducting wall with no vacuum region between the 
magnetofluid and the wall. On the other hand, if a vacuum region bounds the magnetofluid, 
then a tangential electric field can exist at the vacuum-magnetofluid interface and allow 
a Poynting flux normal to the surface. Energy could then flow back and forth between 
the magnetofluid and the vacuum region. For example, if the magnetofluid were to move 
towards the wall thereby reducing the volume of the vacuum region, any vacuum region 
magnetic field would be compressed and so raise the energy contained in this vacuum 
magnetic field. This flow of energy into the vacuum region would require a Poynting flux 
from the magnetofluid into the vacuum region. 

Let us now consider the energy properties of the vacuum region between the mag- 
netofluid and a perfectly conducting wall. The equations characterizing the vacuum region 
are Faraday’s law 


OB 
VxE= a (10.52) 
and Ampere’s law 
VxB=0. (10.53) 
Dotting Faraday’s law with B, Ampere’s law with E and subtracting gives 
ô (B? Ex B 
( J+v-( s ) =0 (10.54) 
Ot \ 2ko Ho 


which is just the limit of Eq.(10.51) for zero density and zero pressure. Thus, Eq. (10.51) 
characterizes not only the magnetofluid, but also the surrounding vacuum region. As men- 
tioned earlier, the Poynting flux is the means by which electromagnetic energy flows be- 
tween the magnetofluid and the vacuum region. 

If the vacuum region is bounded by a conducting wall, then the tangential electric field 
must vanish on the wall. The integral of the energy equation over the volume of both the 
magnetofluid and the surrounding vacuum region then becomes 


ə pU? P B? 
—/a =0 10.55 
Ot J s ( ae T ) re 


since on the wall ds - E x B = 0 where ds is a surface element of the wall. If the wall is 
brought right up to the magnetofluid so there is no vacuum region, then Eq.(10.55) will also 
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result with the additional stipulation that the normal component of the velocity vanishes at 
the wall (i.e., the wall is impermeable). 

A system consisting of a magnetofluid surrounded by a vacuum region enclosed by an 
impermeable perfectly conducting wall will therefore have its total internal energy con- 


served, that is 
U? P B B? 
i: dr (2 ) | / d?r— = const. (10.56) 
Vms 2 y=1 bo Vac Ho 


where Vm is the volume of the magnetofluid and Vyac is the volume of the vacuum region 
between the magnetofluid and the wall. 
This total system energy can be split into a kinetic energy term 


U? 
T= fi dir (10.57) 
and a potential energy term 
P B? 
W= Í d?r (4, + =) (10.58) 
Vv y-1 bo 
so that 
T+wW=€ (10.59) 


where the total energy € is a constant. Here V includes the volume of both the magnetofluid 
and any vacuum region between the magnetofluid and the wall. 
We will now consider a static equilibrium (i.e., an equilibrium with Up = 0) so that 


0= Jo x Bo = VP. (10.60) 
Dotting with either Bo or Jo shows that 
Bo-VPo=0, Jo-VPo=0 (10.61) 
so that V P is normal to the surface defined by the Jo and Bo vectors. 


10.3.2 Self-adjointness of potential energy as a consequence of energy integral 


From Eq.(10.57) it is seen that in static equilibrium Tọ = 0. This means that all the internal 
energy must be in the form of stored potential energy, i.e., 


Wo = £. (10.62) 


It is now supposed that thermal noise causes a small motion of order € to develop at 
each point in the magnetofluid. This means there is a first-order velocity 


Urse (10.63) 


The displacement of a fluid element is obtained by time integration to be 


t 
e= U, dt’; (10.64) 
0 
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because € is small the spatial dependence of U, can be ignored when evaluating the time 
integral, i.e., terms of order €- VU, are ignored since these are of order €?. 
The kinetic energy associated with the mode is 


U2 
éT = J dir Ot (10.65) 
which clearly is of order €?. Since total energy is conserved it is necessary to have 


ôT +6W =0 (10.66) 


leading to the important conclusion that the perturbed potential energy ôW must also be 
of order €?. 

The perturbed pressure and magnetic field can be found to first-order in € by integrating 
Eqs.(10.42) and (10.44) respectively to obtain 


Bı = V x (€ x Bo) (10.67) 


and 

Pi = —:-V P — yPoV -& (10.68) 
showing that the first-order pressure and first-order magnetic fields are linear functions of 
£. However, since it was shown above that dW scales as e?, only terms second-order in € 
can contribute to 6W, and so all first-order terms must average to zero when integrating 
over the volume. To specify that ôW depends only to second-order in £, we write 


W = ôW (£, £) (10.69) 


where the double argument means that ôW is a bilinear function, i.e., W (a£,bn) = 
abôW (£, n) for arbitrary £, n. The time derivative of 6W is thus 


OW = W (È, £) + OW(E, È). (10.70) 


Since ¿ is algebraically independent of £, Eq.(10.70) means that ôW is self-adjoint (i.e., 
ôW is invariant when its two arguments are interchanged). Self-adjointness is a direct 
consequence of the existence of an energy integral. 


10.3.3 Formal solution for perturbed potential energy 


The self-adjointness property can be exploited by explicitly calculating the time derivative 
of the perturbed potential energy. This is done using the linearized equation of motion 


O7é 
Poa =F, (10.71) 
where 
F, = Jo x Bi +J1 x Bo — VP (10.72) 


results from linear operations on € since B1, 4ọJ1 = V x Bı and R are all result from 
linear operations on £. 
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Multiplying Eq.(10.71) by the perturbed velocity € and integrating over the volume of 
the magnetofluid and vacuum region gives 


22 
[eros ($) = [ere Fe. (10.73) 


Although the integration includes the vacuum region, both the right and left hand sides of 
this equation vanish in the vacuum region because no density, current, or pressure exist 
there. 


The left hand side of Eq.(10.73) is just the time derivative of the kinetic energy ôT. 
Since ôT + ôW = 0, it is seen that 


ôW = - [ere -F,(€). (10.74) 
However, since ôW was shown to be self-adjoint, Eq.(10.74) can also be written as ôW = 


-f dré- Fi (é ). Combining these last two equations provides an integrable form for ow, 
namely 


68W 


lI 
| 
NI = 
ae 
aM 
w 
3 
wn 
ry 
= 


È+ fare Fi(é)) 


= — (frere) f (10.75) 


Performing the time integration gives the desired result, namely the change in system po- 
tential energy as a function of the fluid displacement is 


ôW = = [ere -F,(€). (10.76) 


Standard techniques of normal mode analysis can be invoked by assuming that the dis- 
placement has the form 


= Re (Ee) (10.77) 
so the equation of motion can be written as 
—w? pë = F; (£). (10.78) 


Multiplication by E and integrating over the volume gives 


-u frot = f rE mO, (10.79) 


In the earlier discussion of self-adjointness, it was noted that € is essentially an arbitrary 
function for a given &, i.e., any é could be obtained by choosing a suitable time dependence 
for €. Thus, € could be chosen to be proportional to € “ and so the right hand side of 
Eq.(10.79) is also self-adjoint. Equation (10.79) can therefore be recast as 


-u | arok =3 | fore -F,(é) + fore) (10.80) 
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2 


which shows that w? must be pure real. Negative w? corresponds to instability. 


Equation (10.78) can be considered as an eigenvalue problem where w? is the eigen- 
value and € is the eigenvector. As in the usual linear algebra sense, eigenvectors having 
different eigenvalues are orthogonal and so, in principle, a basis set of normalized orthog- 
onal eigenvectors {¢,,, } can be constructed where 


Fi(¢m) = —WmPSm- (10.81) 


Thus any arbitrary displacement can be expressed as a suitably weighted sum of eigenvec- 
tors, 


£ =X Omm (10.82) 


The orthogonality of the basis set gives the relation 
SEE) 25. f Erna =0ifmEn. (10.83) 


Instability will result if there exists some perturbation that makes 6W negative. Con- 
versely, if all possible perturbations result in positive ÂW for a given equilibrium, then the 
equilibrium is MHD stable. 


10.3.4 Evaluation of ôW 


Since F; consists of three terms involving Jo, Bı and P; respectively, )W can be decom- 
posed into three terms, 


W = ôW 3, + Wp, + ôWp,. (10.84) 
Using Eqs. (10.72), (10.67), (10.68) in Eq.(10.76) shows that these terms are 
1 
Wn = -3 | are -Jo x By, (10.85) 
1 
Wg, = —— | d’ré-(V x Bi) x Bo, (10.86) 
2Ho 
1 
ôWpi = > dré - VP}. (10.87) 


Although the above three right hand sides are in principle integrated over the volumes of 
both the magnetofluid and the vacuum regions, in fact, the integrands vanish in the vacuum 
region for all three cases and so the integration is effectively over the magnetofluid volume 
only. The second two integrals can be simplified using the vector identities V - (C x D) = 
D-V x C—C-V x Dand V- (fa) =a-Vf +fV- ato obtain 


1 


2Ho Vinf 
1 3 42 1 

= — d?r Bi +=— ds - Bı x (€ x Bo) (10.88) 

2ko Vinf 24o Smif 


ôWp, = d?r (£ x Bo) - V x By 
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and 
Wp, = a / d?ré-V jé -Vh +V. ¿] 
2 
Vmf 
= ; J ar |-e-V (€-VPo) +yPo(V- £] _ s / ds -€yPoV -€. 
Vmf Sms 


(10.89) 


Since both J x B and VP are perpendicular to B at the surface of the magnetofluid, 
the force acting on the magnetofluid at the surface is perpendicular to B and so the dis- 
placement € of the magnetofluid at the surface is perpendicular to B, i.e., at the surface 
€ = €, .The surface integral in Eq.(10.88) can be expanded 
as ds-By x (€x Bo) = 5— | ds- é; Bo: Bi (10.90) 
20 J Sin 2H0 J Sms 
since ds- Bo = O on the magnetofluid surface. This latter condition is true because the 
magnetic field was initially tangential to the magnetofluid surface (i.e., Jo x Bo = VP 
implies V Py is perpendicular to Bg) and must remain so since the field is frozen into the 
magnetofluid. 


Recombining and re-ordering these separate contributions gives 


1 2 B? 
ôW = A dr fanwege Do xB + VE-V)]} 
2 S Ving Ho 
1 
+77 J. ds-£, [Bo- Bı — toyPoV - £]. (10.91) 
Ho J Sms 


The substitution ds - € =ds - é; has been made for the pressure contribution to the surface 
term on the grounds that ds must be perpendicular to Bo. Further simplification is obtained 
by considering the dot product with Bo of the term in square brackets in Eq.(10.91), namely 
Bo: [Jo x Bi +V(E-VPo)] = -VP-Bi+Bo-V(E-VPo) 

-VP -V x (€ x Bo) + Bo- V (€- VP) 
= V-{VPp x (€ x Bo) + (€- VP) Bo} 
V - [€Bo- VPo 

= 0 (10.92) 
where Eq.(10.61) has been used to obtain the last line. Thus, €-[Jo x By + V (£ - VPo)] = 


£- [Jo x Bi + V (£ - VP)| since Eq.(10.92) shows that the factor in square brackets has 
no component parallel to the equilibrium magnetic field. 


The potential energy variation ôW is now decomposed into its magnetofluid volume 
and surface components, 


2 
2, BR Bil 
Wp. =; f dr O ae aera (10.93) 
y 


: Ho Ho 
mt —€, -Jo x Bi-€, - Vi (L VPo) 
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and 


1 
Ws: = — ds-€, (Bo: Bı — oy PoV - £). (10.94) 
210 Smf 


At this point it is useful to examine the parallel and perpendicular components of By. 
Finite Bıı corresponds to changing the curvature of field lines (twanging or plucking) 
whereas finite B4) corresponds to compressing or rarefying the density of field lines. The 
equilibrium force balance can be written as 


B2 i 
pV Py = FN lee +KBé (10.95) 


where x =o - VBo is related to the local curvature of the equilibrium magnetic field. 
Applying the vector identity V (C - D) = C-VD+D-VC+DxVxC+CxVxD 
gives 


k = — Bo x V x Bo, (10.96) 


a relation which will be used when evaluating By). From Eq.(10.67) it is seen that the 
parallel component of the perturbed magnetic field is 


By = Bo-V x (£, x Bo) 
= (£i x By)-V x Bot+V- |(€, x Bo) x Bo 
= -Bor — V- (B£) 


= -Boé -k en By(V-£,) — $4-v 20 


= -Bo Peps +V-£,]+ Poe -VPp. (10.97) 
0 


The term involving Jo in Eq.(10.93) can be expanded to give 


£i- JoxBı = £; -Jor x BoBiy +£; - JoBo x Bus 


B i 
(€;. 2° VB) as +E,- Jo) Bo x Bit. (10.98) 


Substituting for this term in Eq.(10.93), factoring out one power of Bı], and then sub- 
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stituting Eq.(10.97) gives 


; B2 
yP (VE)? + 2 


1 3 B2 Ho 
ôWrp = 5 far i| Bi 
2 +— —(€,-VR 


-£| - JojBo x Bi, —€, - Vi (€,-VP) 


0 

_ Bu B Hogi: VPo 

' u 1| B 
0 o 
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1 
-5f ds-€, (€, - VP). (10.99) 
Smf 


A new surface term has appeared because of an integration by parts; this new term is 
absorbed into the previous surface term and the fluid and surface terms are redefined by 
removing the primes; thus 


eee 3 f YoPo(V-é) + Bz, + Bg, -n+(V-€,)P 
2 Uo Vint —2uo (€ VP) (ELK) +E, x Bi: (10 Joy Bo 
(10.100) 
and 
1 
ôWs = Za. ds- {Bo - Bi — po (PoV -E +E, -VP)} 
Ho J Sms 
1 
= — ds-€, {Bo- Bi + uP} (10.101) 
2 Lo Smf 


where Bo - ds = 0 and Eq.(10.68) have been used to simplify the surface integral. The sur- 
face integral can be further re-arranged by considering the relationship between the vacuum 
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and magnetofluid fields at the perturbed surface. If the equilibrium force balance is inte- 
grated over the volume of the small Gaussian pillbox located at the perturbed magnetofluid 
surface in Fig.10.5, it is seen that 


vac 


B? 1 B? 
o= f dry - (GES 1-1 BB a ds P+ 
Vpillbox,ps 219 Ho S, ps 2 Ho magnetofluid 


(10.102) 
since ds - B =0 at the perturbed surface. The subscripts ps indicate that the volume and 
surface integrals are at the perturbed surface. 

Quantities evaluated at the perturbed surface are of the form 


fpertsfe = fo + fi +€-V fo; (10.103) 


i.e., both absolute and convective first-order terms must be included. Since the pillbox 
extent was arbitrary in Eq.(10.102), the integrand [P + B?/ 24o] pe must vanish 
at each point on the perturbed surface, giving the relation 


Bo - By B? Bo- B B? 
(A+) +€-V (r+>-) =( :) rev ( ) 
Lo magneto 2Lo magneto Lo ae 2Uo jae 


fluid fluid 
(10.104) 


magnetofluid 


which can be rewritten as 


(n +55) =¢ s[e] (=) 
1 + —— = or > —— s 
Ho magnetofluid 2Ho magnetofluid Ho vac 


(10.105) 
Thus the surface integral Eq.(10.101) can be rewritten as 


vac 
2 1 


1 B 
Ws =3/ ds-€, £,-V P+ 
Smf 


+—— ds-€, (Bo- B1),,,.- 
uate 2Ho Simf 2 


(10.106) 
The volume integral in Eq.(10.100) is over the magnetofluid volume and the direction of 
ds in Eq.(10.106) is outwards from the magnetofluid volume. The energy stored in the 
vacuum region between the magnetofluid and wall is 


1 


bWyae = — d?r B2, 
2M0 J Vac ; 
1 
= dr Biv -VX Auy 
2Ho J Vac 
1 
= — dr V È (Aw x Buy) 
2Ho J Vrac 
1 
=Z ds - (Ai x By) (10.107) 
2Ho J Syao 


where ds points out from the vacuum region and V x Bı, = 0 has been used when inte- 
grating by parts. At the conducting wall (which might be at infinity) the integrand vanishes 
since it contains the factor ds x Aj, which is proportional to the tangential electric field 
and the tangential electric field vanishes at a conductor. Thus, only the surface integral over 
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the magnetofluid-vacuum interface remains. Since an element of surface ds pointing out 
of the vacuum points into the magnetofluid, using ds to mean out of the magnetofluid (as 
before), this becomes 


OWrac = ds - (Ay, x By,) (10.108) 


20 I Sin f 
where Smp has been used instead of Syac to indicate that ds points out of the magnetofluid. 
The tangential component of the electric field must be continuous at the magnetofluid- 
vacuum interface. This field is not necessarily zero. The electric field inside the mag- 
netofluid is determined by Ohm’s law, Eq.(10.37). Thus the electric field on the mag- 
netofluid side of the magnetofluid-vacuum interface is 


Eip = —Vı X Bo (10.109) 
while on the vacuum side of this interface the electric field is simply 
OA. 

Ot 


where Aj, is the vacuum vector potential. Defining the surface normal unit vector ù and 
integrating both electric fields with respect to time, the condition that the tangential electric 
field is continuous is seen to be 


Eyes (10.110) 


Ax Ay = fx (È x Bo) = —f-€By (10.111) 


where 7-Bo = 0 has been used. Thus, the second surface integral in Eq.(10.106) can be 
written as 


1 1 
s— | dsnr-€,By- Bru. =—s— | dsnxAty - Bry = OWyac. (10.112) 
2o Js 2o Js 


These results are now summarized. The perturbed potential energy can be expressed 
as 


OW = Wp + Wint + OWoac (10.113) 
where the contribution from the fluid volume interior is 
2 ‘ ‘ 2 
eo See a l YhoPo (V -E)?+B?, +B 2E, -w+(V-€)) ) 
2Ho J Ving +€, x By, - Botto Joy — 20 (£1: r) (€,-VP) 
(10.114) 


the contribution from the vacuum-magnetofluid interface is 


2 vac 
OWint = —— ds- -V jor + Z] (10.115) 
2 OS Smf 2 magnetofluid 
and the contribution from the vacuum region is 
OWoac = d?r B?,. (10.116) 


2g vac 
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10.4 Discussion of the energy principle 


The terms in the integrands of Wp ,ôW;int, and OWyac are of two types: those that are 
positive-definite and those that are not. Positive-definite terms always increase 6W and 
are therefore stabilizing whereas terms that could be negative are potentially destabilizing. 
Consideration of 6Wr in particular shows that magnetic perturbations interior to the mag- 
netofluid and perpendicular to the equilibrium field are always stabilizing, since these per- 
turbations appear in the form Bais It is also seen that a positive-definite term ~ (V - € y? 
exists indicating that a compressible magnetofluid is always more stable than an otherwise 
identical incompressible magnetofluid. Hence, incompressible instabilities are more vio- 
lent than compressible ones. It is also seen that two types of destabilizing terms exist. One 
gives instability if 

(K-€,)(€)-VPo) > 0; (10.117) 
this is a generalization of the good curvature/bad curvature result obtained in the ear- 
lier Rayleigh-Taylor analysis. In the vicinity of the magnetofluid surface all quantities 
in Eq.(10.117) point in the same direction and so Eq.(10.117) can be written as 


KVP >O => instability (10.118) 


which gives instability for bad curvature (« parallel to V P») and stability for good curva- 
ture (« antiparallel to VP»). A Bennett pinch has bad curvature and is therefore grossly 
unstable to Rayleigh-Taylor interchange modes. Instabilities associated with Eq.(10.117) 
are called pressure-driven instabilities, and are important in plasmas where there is signifi- 
cant energy stored in the pressure (high 3 where 3 = 2uo Po / B8). 

The other type of destabilizing term depends on the existence of a force-free current 
(i.e., Joy, # 0) and leads to internal kink instabilities. These sorts of instabilities are called 
current-driven instabilities (although strictly speaking, only the parallel component of the 
current is involved). 

There also exist instabilities associated with the magnetofluid-vacuum interface. The 
energy ÔW;nt can be thought of as the change in potential energy of a stretched membrane 
at the magnetofluid-vacuum interface where the stretching force is given by the difference 
between the vacuum and magnetofluid forces pushing on the membrane surface; instability 
will occur if 6bWin~ < 0. When investigating these surface instabilities it is convenient to 
set dW to zero by idealizing the plasma to being incompressible, having uniform internal 
pressure, and no internal currents. Surface instabilities can exist only if the surface can 
move, and so require a vacuum region between the wall and the plasma. Thus, moving a 
conducting wall right up to the surface of a conducting plasma prevents surface instabilities. 
These surface instabilities will be investigated in detail later in this chapter. 


10.5 Current-driven instabilities and helicity 


We shall now discuss current driven instabilities and show that these are helical in nature 
and are driven by gradients in Jo///Bo. To simplify the analysis P — 0 is assumed so that 
pressure-driven instabilities can be neglected since they have already been discussed. On 
making this simplification, Eq.(10.97) shows that the parallel component of the perturbed 
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magnetic field reduces to 
By = —Bo (26, -k +V- £]. (10.119) 
Thus, we may identify 
B? = Bl, + Bi = BY, + BSR, +V- E] (10.120) 


and so the perturbed potential energy of the magnetofluid volume reduces to 


1 J 
Wr =—— | drs B? +E, xB, acl f (10.121) 
2Ho Bo 
Equation (10.67) shows that the perturbed vector potential can be identified as 


A =x Bo (10.122) 
so that Eq.(10.121) can be recast as 


1 J 
ie fë { B2—A,- Be) . (10.123) 
2Ho Bo 


We now show that finite A, - Bı corresponds to a helical perturbation. Consider the 
simplest situation where A, -B, is simply a constant and define a local Cartesian coordinate 
system with z axis parallel to the local Bo. Equation (10.122) shows that Ay = Ay, ĉ + 
Ay so 


Ary 3A 


A,B, = —Atz + Ary (10.124) 
Suppose both components of A, are non-trivial functions of z and, in particular, assume 
Aja = Re Aj, exp(ikz) and Ay, = Re Aj, exp(ikz). In this case 
1 , OA, , Aiz k ar P 
Aı -Bı = Be Ate ee t Aly Bee | se [i (Aj Aty = AjyAiz)] 
(10.125) 


which can be finite only if Aj,Aiy is not pure real. The simplest such case is where 


Aiy = iAiz so 


A, - By = k|Ai,|? (10.126) 
and 
Ai = Re [Ais (4 +19) exp(ikz)] (10.127) 
which is a helically polarized field since Aj, ~ cos kz and Aj, ~ sin kz. 


10.6 Magnetic helicity 
Since finite A; - By corresponds to the local helical polarization of the perturbed fields, 


it is reasonable to define A - B as the density of magnetic helicity and to define the total 
magnetic helicity in a volume as 


K= | @raA-B. (10.128) 
V 


10.6 Magnetic helicity 321 


The question immediately arises whether this definition makes sense, i.e., is it reasonable 
to define A - B as an intensive property and K as an extensive property. An obvious 
problem is that A is undefined with respect to a gauge, since A can be redefined to be 
A' = A+Vf where f is an arbitrary scalar function without affecting the magnetic field, 
because V x A =V x (A+V/f). The above definition for magnetic helicity would be of 
little use if K depended on choice of gauge. However, if no magnetic field penetrates the 
surface S enclosing the volume V, the proposed definition of K is gauge-independent. This 
is because B-ds = 0 everywhere on the surface if no magnetic field penetrates the surface. 
If this is so, then 


[erarvy-B = f čraB + | &rv- (B) 


[trace + [as - (fB) 


| drA-B (10.129) 
V 


and so the helicity is gauge-independent even though the helicity density is gauge-dependent. 
Let us consider the situation where there is no vacuum region between the plasma and 

an impermeable wall. Thus the normal fluid velocity u, must vanish at the wall. From 

Ohm’s law the component of the perturbed electric field tangential to the wall Ey; is just 


E: = —Wy, x Bo = 0 (10.130) 


since Bo lies in the plane of the wall. Thus, an impermeable wall is equivalent to a con- 
ducting wall. If the magnetic field initially does not penetrate the wall, i.e. B-ds = 0 
initially, then the field will always remain tangential to the wall and the helicity K in the 
volume enclosed by the wall will always be a well-defined quantity (i.e., will always be 
gauge-invariant). 

The time derivative of the magnetic helicity density together with Faraday’s law gives 
a conservation equation for helicity density. This is seen by direct calculation: 


fô) OA OB 
—_—(A-B) = -_B_-A.- 
an | ) ot ot 
= (E+Vy)-B+A-VxE (19:131) 


= 2E-B+V-(yB+E~x A) 
where ọ is the electrostatic potential. Since the ideal MHD Ohm’s law gives E- B =0, 


Eq. (10.131) can be rearranged in the form of a helicity conservation equation 


<(A-B)+V-(pB+Ex A)=0. (10.132) 


Integration of Eq.(10.132) over the entire magnetofluid volume gives a global helicity con- 
servation relation 


Z [feras] + [ asn: (YB +E x A)=0. (10.133) 
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Since both B-n=0 and E; = 0 at the wall, this reduces to 


K= pera - B =const.; (10.134) 


i.e., the total helicity (integral of the helicity density) is conserved for an ideal plasma 
surrounded by a rigid or conducting wall. 

Now recall that A; is a linear function of € as given in Eq.(10.122) and that € was 
assumed to be of order €. Consider the implications of this result for perturbed magnetic 
fields. We write B = Bo + Bı and A= Ag + A, so that the total magnetic helicity, a 
constant, is 


K = f dr Ao -V x Ao = fr (Ao + Ar)-V x (Ao + At) 


f dr Ao -V x Ao + fd?r Ay -V x Ao 
+ fdr Ao -V x Ay + fdr Ay -V x At. 
(10.135) 
Thus, to first-order in the perturbation (i.e., to order €), 


[eran Vx Aot drao: Vx Ar =0 (10.136) 
and to second-order (i.e., to order €? which is what is relevant for the energy principle), 


fæ A, -VxA,= per A, -B, =0. (10.137) 


This can be compared to Eq.(10.123); the second term is almost the same as Eq.(10.137) 
except for a factor — 4o Joj / Bo which, in general, is some complicated function of position. 
However, in the special case where po Joj /Bo = const., it is possible to factor fig Joy / Bo 
out of the integral and obtain, using Eq.(10.137), 


1 
Wr = — J d?r B? (10.138) 
2Ho 


which is positive-definite and therefore gives absolute stability. Thus, equilibria which have 
LoJoj/Bo = A where A = const. are stable against current-driven modes. Since these 
equilibria are helical or kinked, they may be considered as being the final relaxed state 
associated with a kink instability —once a system attains this state no free energy remains 
to drive further instability. Since pressure has been assumed to be negligible, Eq.(10.60) 
implies that the equilibrium current must be parallel to the equilibrium magnetic field, 
but does not specify the proportionality factor. What has been shown here is that if the 
proportionality factor is spatially uniform, i.e., 


Lig Jo=ABo (10.139) 


where A does not depend on position, then the system is stable against any further helical 
perturbations. This gives rise to the relation 


V x Bo = ABo (10.140) 
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where A is spatially uniform. This equation is called the force-free equilibrium and its 
solutions are helical vector fields, namely fields where the curl of the field is parallel to 
the field itself. If a field is confined to a plane, then its curl will be normal to the plane 
and so a field confined to a plane cannot be a solution to Eq.(10.140). The field must be 
three-dimensional. 

To summarize, it has been shown that current-driven instabilities are helical and drive 
the plasma towards a force-free equilibrium as prescribed by Eq.(10.140). Current-driven 
instabilities are energized by the gradient of Jo /Bo and become stabilized when Jo) /Bo 
becomes spatially uniform. Gradients in Jo, /Bo can therefore be considered as the free 
energy for driving kink modes. When all the free energy is consumed, the kinks are stabi- 
lized and the plasma assumes a force-free equilibrium with spatially uniform Joj / Bo. 

This tendency to coil up or kink is a means by which the plasma increases its inductance. 
However, when the plasma coils up into a state satisfying Eq.(10.140) it is in a stable 
equilibrium. This stable equilibrium represents a local minimum in potential energy. There 
might be several such local minima, each of which has a different value as sketched in 
Fig.9.2. As mentioned earlier this set of discrete energy levels is somewhat analogous 
to the ground and excited states of a quantum system. Here, the vacuum magnetic field 
is analogous to the ground state, while the various force-free equilibria (i.e., solutions of 
Eq.(10.140)) are analogous to higher energy states. 


10.7 Qualitative description of free-boundary instabilities 


The previous section considered internal instabilities of a magnetofluid bounded by a rigid 
wall so that no vacuum region existed between the magnetofluid and the wall. Let us 
now consider the other extreme, namely a situation where not only does a vacuum region 
exist external to the magnetofluid, but in addition, the location of the vacuum-magnetofluid 
interface is not fixed and can move around. To focus attention on the effect of surface 
motions, the simplest non-trivial configuration will be considered, namely a configuration 
where the interior pressure is both uniform and finite. This corresponds to having VP = 
O in the interior so that the entire pressure gradient and therefore the entire J x B force is 
concentrated in an infinitesimally thin surface layer. 

The MHD energy principle showed that compressibility, manifested by having finite 
V - & is stabilizing. Therefore, if a given system is stable with respect to incompressible 
modes, it will be even more stable with respect to compressible modes, or equivalently, with 
respect to modes having finite V - €. Thus by assuming V - € =0, the worst-case scenario is 
considered and, in addition, the analysis is simplified. Finally, it is assumed that the plasma 
is cylindrical and uniform in the z direction (axially uniform). For example, the Bennett 
pinch would satisfy these assumptions if all the current were concentrated on the plasma 
surface. The physical basis of the two main types of current-driven instability, sausage and 
kink, will first be discussed before engaging in a detailed mathematical analysis. 


10.7.1 Qualitative examination of the sausage instability 


Consider a Bennett pinch (z-pinch), that is an axially uniform cylindrical plasma with an 
axial current and an associated azimuthal magnetic field. In addition, as discussed above 
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the pressure is assumed to be (i) uniform in the interior region r < a where a is the plasma 
radius and (ii) zero in the exterior region r > a. In equilibrium, the radially inward J x B 
pinch force balances the radially outward force associated with the pressure gradient so 
that —J.9Beo = OPo/Or where both sides of this equation are finite only in an infinitesi- 
mal surface layer at r = a. We now suppose that thermal noise causes the incompressible 
plasma to develop axially periodic constrictions and bulges as shown in Fig.10.6. At the 
constrictions the azimuthal magnetic field Bọ = uol /2ar becomes larger than in equilib- 
rium because r < a at a constriction whereas J is fixed. Thus, at the constriction the pinch 
force ~ B3/r is greater than its equilibrium value and so exceeds the outward force from 
the internally uniform pressure. The resulting net force will be inwards and will cause a 
radial inwards motion which will enhance the constriction. Because the configuration is 
assumed incompressible, any plasma squeezed inwards at constrictions must flow into the 
interspersed bulges shown in Fig.10.6. The azimuthal magnetic field at a bulge is weaker 
than its equilibrium value because r > a at a bulge so the tables are now turned in the 
competition between outward pressure and inward pinch force — at a bulge the pressure 
exceeds the weakened pinch force and this force imbalance causes the bulge to increase. 
Hence any initial perturbed combination of constrictions and bulges will tend to grow and 
so the system is unstable. This behavior is called the ‘sausage’ instability because what 
results is a plasma resembling a string of sausages. 


current 


Figure 10.6: Sausage instability, current is axial, magnetic field is azimuthal 
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Figure 10.7: Kink instability can occur when the equilibrium magnetic field is helical 


The sausage instability can be prevented by surrounding the plasma with a perfectly 
conducting wall or by immersing the plasma in a strong axial vacuum magnetic field gen- 
erated by currents in external solenoid coils coaxial with the plasma. If a plasma with 
embedded axial field attempts to sausage, the axial rippling of the plasma would have to 
bend the embedded axial field since this axial field is frozen into the plasma. Any bending 
of a magnetic field requires work and so the free energy available to drive the sausage insta- 
bility would also have to do work to bend the axial field. If bending the axial magnetic field 
absorbs more energy than liberated by the sausaging motion, the plasma is stabilized. The 
axial field stabilizes the plasma in a manner analogous to a steel reinforcing rod embedded 
in concrete. The other method for stabilization, a close-fitting conducting wall, works be- 
cause image currents induced in the wall by the sausage motion produce a magnetic field 
which interacts with the plasma current in such a way as to repel the plasma away from the 
wall. 


10.7.2 Qualitative examination of the kink instability 


The axial magnetic field B, combined with the azimuthal magnetic field Bg produced by 
the plasma current results in a magnetic field that is helical. This helical magnetic field is 
susceptible to the helical kink instabilities sketched in Fig.10.7. At the concave parts of 
the plasma surface there is a concentration of the azimuthal field resulting in a magnetic 
pressure that increases the concavity. Similarly, at the convex portions of the surface, the 
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azimuthal field is weaker so that the convex bulge will tend to increase. This is just the 
hoop force trying to increase the major radius of a curved current. If the externally imposed 
axial magnetic field is sufficiently strong, the energy required to bend the external field will 
exceed the free energy available from the kinking and the instability will be prevented. 


10.7.3 Qualitative examination of wall stabilization 


Now consider a plasma with no externally produced B, field but surrounded by a close- 
fitting perfectly conducting wall. The plasma axial current generates an azimuthal field in 
the vacuum region between the plasma and the wall. If an instability causes the plasma to 
move towards the wall, then the vacuum region becomes thinner and the magnetic field in 
the vacuum region becomes compressed since it cannot penetrate the wall or the plasma. 
The increased magnetic pressure acts a restoring force, pushing the plasma back away 
from the wall. Thus, a close-fitting perfectly conducting wall can stabilize both kinks and 
sausages. 


perfectly conducting 
wall at r = b 
surface 
current 
Pressure region 
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uniform 
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plasma vacuum 
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Figure 10.8: Equilibrium for free-boundary stability analysis: assumed cross-section (left) 
and assumed pressure profile (right). 


10.8 Analysis of free-boundary instabilities 


Consider the azimuthally symmetric, axially uniform cylindrical plasma of radius a shown 
in Fig.10.8. The plasma is subdivided into two concentric regions consisting of (i) the in- 
terior region 0 < r < a_ and (ii) a thin surface layer a- <r < ax. In the limit of 
infinitesimal surface thickness, a_ approaches a1. The interior pressure Po is uniform and 
the interior current density is zero so that both J x B and V P are zero in the interior. Thus, 
a finite current density exists only in the surface layer and everywhere else the magnetic 
field is a vacuum magnetic field. The plasma is also assumed to be surrounded by a per- 
fectly conducting wall located at a radius b where 6 is larger than a. The MHD equilibrium 
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can be written as > 


B 
0 = — uV P v( 2) + Bo - VBo. (10.141) 


Because V - Bo = r-'0/0r (rBo,) = 0 for an azimuthally symmetric, axially uni- 
form system and because B, cannot be singular, Bo, must be zero everywhere. Since all 
equilibrium quantities depend only on r, the third term in Eq.(10.141) can be expanded 


Bo - VBo = (Boo(r)9 + Bo-(r)2) f V (Boo(r)6 + Bo-(r)2) 


TA (10.142) 
= B2,0- Vb = -L F. 
r 
Thus, Eq.(10.141) has only r components and reduces to the relation 
EE LE Bos (10.143) 
=ar Gr V2 ro l 
On integrating across the surface layer, this becomes 
Bosz Beg + Be 
YB go E nia (10.144) 


2Ho 24o 
where the subscript p refers to being at the inner radius (plasma side) of the surface layer and 
the subscript v refers to the outer radius (vacuum side). The reason there is no Bopa 
term on the left hand side is that there are no interior plasma currents (from Ampere’s 
law 27aBopo = Lolz = 9). 
The equilibrium surface can be described by the function r = a or equivalently, the 
surface-defining equation 


So(r) =r—a=0. (10.145) 
The incompressible perturbation is characterized by a harmonic deformation of the plasma 
surface such that r = a + €e'9+i**_ The surface equation for the perturbed surface is 
thus 
S(r,0,2z) =r — a — elm tikz — 0 (10.146) 
where without loss of generality it is assumed that € > 0. The equilibrium magnetic field 
had Bor = 0, i.e., had no component normal to the surface, and so the equilibrium magnetic 
field lines at the surface lie in the surface. This is equivalent to stating that Bo - VSo = 0 
which implies that the equilibrium magnetic field is tangential to the surface. Since the 
magnetic field is assumed frozen into the plasma, the magnetic field must continue to be 
tangential to the surface even when the surface becomes deformed from its equilibrium 
shape. Thus, the condition 
B-VS=0 (10.147) 
must be satisfied at all times where V.S is in the direction normal to the surface; this is 
essentially a statement that no magnetic field line penetrates the surface. 
A quantity f at the perturbed surface (denoted by the subscript ps) can be expressed 
as 
ps = fo + fi +E-V fo (10.148) 
where the the middle term is the absolute first-order change and the last term is the convec- 
tive term due to the motion of the surface. 
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If a configuration is to be stable, any perturbation must generate a restoring force which 
pushes the perturbed surface back to its equilibrium location. The competition between 
destabilizing and restoring forces is found by integrating Eq.(10.141) across the perturbed 
surface. This integration gives Eq.(10.144) evaluated at the perturbed surface, i.e., using 
Eq.(10.148) to take into account the effect of the perturbation of the surface. Since € was 
assumed positive (outward bulge at 0 = 0, z = 0) the perturbed system will be stable if the 
effective pressure on the outside of the perturbed surface exceeds the effective pressure on 
the inside, i.e., if 


B2, + B2 BŽ, 
| at =] > |P+ 5 — stable. (10.149) 
Lo perturbed sfc Ho perturbed sfc 


In this case, the restoring force pushes back the bulge and makes the system revert to its 
equilibrium condition. Subtracting the equilibrium pressure balance relation, Eq.(10.144), 
this becomes 


Bovo Biwo + Bovz Biz z [Pee t] > Pi i BopzPipz E ð P Bopz 
Ho T 2 Ho Ho ðr 2Ho 
(10.150) 


where all quantities are evaluated at r = a. 
Equation (10.150) can be simplified considerably because of the following relation- 
ships: 
1. There are no currents in either the plasma interior or the external vacuum so 


3 Bopz OBoy z 


= = 0. 10.151 
Or Or ( ) 
2. The adiabatic pressure equation gives 
Pp+EVP+7P/V-E=0. (10.152) 


Since incompressibility V - € = Ohas been assumed, the adiabatic relation reduces 
to 


P, +€& OPo/Or=0. (10.153) 
3. From Ampere’s law it is seen that B>,g ~r~? so [OB5,,4/Ar] ead —2 B2 ,/a; thus 


Eq.(10.150) simplifies to 


BovePive F BovzBivz = É BR > BopzP pz => stable (10.154) 


where again all fields are evaluated at r = a. 
To simplify the algebra all fields are now normalized to Bovo (a). The normalized fields 
are denoted by a bar on top and are 


Bo Bovz Bo Bopz 
g Bovo (a) f g Boola) 
B En Biyz D ran Biwo = = Bipz (10.155) 
luz = 1v0 = 
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so that, when expressed as a relationship between normalized fields, Eq.(10.154) becomes 


Big N E pope, =e (10.156) 
a 


What remains to be done is express all first-order magnetic fields in terms of €. This is 
relatively easy because the current is confined to the surface layer so the magnetic field is 
a vacuum magnetic field everywhere but exactly on the surface. 

Since B is a vacuum magnetic field for both O < r < aanda < r < b, it must be of the 
form B =Vx where V7 = 0 in both regions, but there could be a jump discontinuity at 
r = a. Because the perturbed fields were assumed to have an exp(im0@ + ikz) dependence, 
the Laplacian becomes 


2 2 
E EON) (z ' r) x(r)=0 (10.157) 


dr? r dr r2 


which is just the equation for modified Bessel functions, Jm;(|k|r) and Kim;(|k|r). Ab- 
solute value signs have been used to avoid possible confusion later for situations where m or 
k could be negative. The J),,,(|K|7) function is finite at r = 0 but diverges at r = co while 
the opposite is true for K),,,|(|k|r). 

The surface current causes a discontinuity in the tangential components of the perturbed 
magnetic field. Hence, different solutions must be used in the respective plasma and vac- 
uum regions and then these solutions must be related to each other in such a way as to 
satisfy the requirements of this discontinuity. The forms of the plasma and vacuum solu- 
tions are, in addition, constrained by the respective boundary conditions at r = O and at 
r = b. In particular, the K),,)(|k|r) solution is not allowed in the plasma because of the 
constraint that the magnetic field must be finite at r = 0; thus inside the plasma the solution 
is of the form 

X = al\m|(|Alr) in plasma region O < r < a_. (10.158) 
On the other hand, both the Jm;(|k|r) and Kmį(|k|r) solutions are permissible in the 
vacuum region and so the vacuum region solution is of the general form 
X = bilmi (kr) + 82Kmi(|k|r) in vacuum regiona} <r <b. (10.159) 
The objective now is to express the coefficients a, 64, and 6 in terms of €so that all 
components of the perturbed magnetic field can be expressed as a function of €, both in the 
plasma and in the vacuum. 

The functional dependence of these coefficients is determined by considering boundary 
conditions at the wall and then at the plasma-vacuum interface: 

1. Wall: Since the wall is perfectly conducting it is a flux conserver. There is no radial 
magnetic field initially and so there is no flux linking each small patch of the wall. 
Thus Bior (b) must vanish at the wall in order to maintain zero flux at each patch of 
the wall. Using Eq.(10.159) to calculate the perturbed radial magnetic field at the wall, 
this condition means that 


By Ty (11D) + 82K im (lk Ib) = 0 (10.160) 
which may be solved for 3, to give 


Bo = —By 


ï 
deL, (10.161) 


K 


[m| 
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here the circumflex means the modified Bessel function is evaluated at the wall, i.e., 
with its argument set to |k|b. Equation (10.159) can then be recast as 


X = B |D m (RIP) Elin) — Fim Kim (klr) (10.162) 


where 6 = 6,/K In|" The wall boundary condition has reduced the number of inde- 
pendent coefficients by one. 


2. Vacuum side of plasma-vacuum interface: Here Eq.(10.147) is linearized to obtain 


B,-VS0 + Bo- VS; =0 (10.163) 
or using Eqs.(10.145) and (10.146) 
Bison (Z + kBos-)E = 0. (10.164) 
On the vacuum side Eq.(10.162) gives 
Bier = (RIB T Ra Taa (10.165) 


where omission of the argument means that the modified Bessel function is evaluated 
at r = a. Substitution of Eq.(10.165) into Eq.(10.164) gives 


i(= + kBoz) 
a 


Ikl aye 


(10.166) 


|m| [m| |m] 


5 £ 
= Fh K ir 
so the complete vacuum field can now be expressed in terms of £. 


3. Plasma side of plasma-vacuum interface: The plasma-side version of Eq.(10.163) 
gives 
Bipr — ikBopz£ = 0 (10.167) 
since Bopo vanishes inside the plasma. From Eq.(10.158) the perturbed radial mag- 
netic field on the plasma side of the interface is 


Bipr = |klOL in|: (10.168) 
The plasma side version of Eq.(10.164) is thus 
ik Bopz 
= 10.1 
a IKI ° (10.169) 


and so the plasma fields can now also be expressed in terms of £. 
The stability condition, Eq.(10.156), can be written in terms of a and 6 to obtain 


b (= + ik ) (Firat Kon — Fini Kim) = : > aBopzikI m]. (10.170) 


Substituting for œ and 6 and re-arranging the order gives 


k|aB? = x 
| | Opz I |kla T R ang K 


[m| 


Lin kaBow 2)? | Lm Elm) — fim Kim 
| (m + kaBovz) | i [ml kel ; L| > 1 = stable. (10.171) 
|r| Im] Imit im] 
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If we introduce the normalized pressure 


F 2uo Po 
P = — (10.172) 
Bĝ,o(a) 


the MHD equilibrium, Eq.(10.144), can be written in normalized form as 


Po + Boys = 1 + Boyz. (10.173) 
Substitution for Bõpz into Eq.(10.171) and rearranging the order of the second term gives 


|kla [1 + Boy. — Pol >1 


J! |kla r RÙ 


Im] [m| |m| 


|m| | _ (m+ kaBouz)? Face + Lin Km 
f Fi ~ 
|m| 
(10.174) 
which is the general requirement for stability of a configuration having a specified internal 
pressure, vacuum axial field, plasma radius, and wall radius. 


Conditions for sausage and kink instability | Let us now examine the effect of the 
various factors in Eq.(10.174). For large argument, the modified Bessel functions have the 
asymptotic form 


lim IJim(s)—e*; lim Kjmj(s) > e* (10.175) 
so if the wall radius goes to infinity, the factor 
-Iim in ar nK im] Kimi 
i Ki = i Ki | > Ki = positive definite. (10.176) 


Since bringing the wall closer is stabilizing, the factor 


-I m Kim, + La) 
I K -I K 


|m| 


1 1 4 

|m| [m| |m] 

will always be positive-definite. In particular, if b — a then J mÉ zal >Í press al so that 
the wall stabilization becomes arbitrarily large. 

As |k|a — oo the left hand side of Eq.(10.174) becomes infinite. Thus, the configura- 
tion is stable with respect to modes having short axial wavelengths. The reason for this is 
that short wavelength perturbations cause more bending of the magnetic field than a long 
wavelength perturbation and so require more energy. 

We therefore focus attention on modes with a long axial wavelength (i.e., modes with a 
small k) since these offer the only possibility of instability. The analysis can be subdivided 
into specific cases, such as m = 0, |m| = 1, |m| > 1, close-fitting wall, no wall, low 
pressure, high pressure, etc. Before getting into the details let us take a broader look at the 
effect of the various terms in Eq. (10.174). Since Jm] /I Imi > 0 we see that increasing P 
is destabilizing, while increasing B3,,, is stabilizing. Also, if m + kaBo,, = 0 the second 
term vanishes, leading to reduced stability; by defining the wavevector as k = (m/ a)ĝ+kż2 
it is seen that m + kaBosz = 0 corresponds to having k - Bo = 0 at the surface. 
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Sausage modes 
The m = 0 modes are the sausage instabilities and here Eq. (10.174) reduces to 


_ aca Nod -> | Slee + ÑK 1 
[1 + B3,. — Po | J ppa E o g ae D 
To I Ko — [646 Ikla 


For a given normalized plasma pressure and normalized wall radius b/a, this expression 
can be used to make a stability plot of B, , versus |k|a. Since the wall always provides 
stabilization if brought in close enough, let us consider situations where there is no wall 
(i.e., b — 00) in which case the stability condition reduces to 


a ~, [I - -K 1 
| 2 0 | 2 TANO. = 
[1+ Bbs- — Po] H + Boye | Ki | > Ka = stable. (10.178) 
For small arguments, the modified Bessel functions of order zero have the asymptotic val- 
ues 


2 
h(s) S14 = Kols) = -ms (10.179) 


so the stability criterion becomes 


Boyz [1 — k?°a? In(|k]a)] > P = stable. (10.180) 


This gives a simple criterion for how much Bĝ,, is required to stabilize a given plasma 
pressure against sausage instabilities. The logarithmic term is stabilizing for |kļa < 1 
but is destabilizing for |k|a > 1; however this region of instability is limited because we 
showed that very large |k]a is stable. 

Kink modes 

The finite m modes are the kink modes. It was shown earlier that large |k|a is stable so 
again we confine attention to small |k|a. In addition, we again let the wall location go to 
infinity to simplify the analysis. The stability condition now reduces to 


Ikla [1 + Boy. — P] 


in |kla Ki 


|m| |m| 


m kaBovz)? —K\m 
| | p MRA Doua) | | | > 1— stable. (10.181) 


For m Æ 0 the small argument asymptotic asymptotic limits of the modified Bessel func- 
tions are 


1 Im] — 4]! ysy -iml 
Timj(s) = Ta (5) > Kimj(s) © mit (5) (10.182) 


so the stability condition becomes 


ka? [1+ Bi, — Po] + (m + kaBoyz)” > |m| => stable (10.183) 


which is a quadratic equation in ka. Let us consider plasmas where B2,, >> 1 and 
B2,- >> Py; this corresponds to a low beta plasma where the externally imposed axial 
field is much stronger than the field generated by the internal plasma currents (tokamaks 
are in this category). Let us define 


x = kaBovz (10.184) 
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so the stability condition becomes 


x? +(m+-2)? > |m| = stable. (10.185) 

Without loss of generality x can assumed to be positive, in which case instability occurs 
only when m is negative. Thus, Eq.(10.185) can be written as 

2x? — 2|m|x +m? — |m| > 0 = stable. (10.186) 


The threshold for instability occurs when the left hand side of Eq. (10.186) vanishes, i.e., 
at the roots of the left hand side. These roots are 


+ ,/2 — m? 
ERE - Ha (10.187) 


Since the left hand side of Eq.(10.186) goes to positive infinity for |x| — oo, the left hand 
side is negative only in the region between the two roots. Thus, the plasma is unstable only 
if |x| lies between the two roots. The stability condition is that |x| must lie outside the 
region between the two roots, i.e., for stability we must have 


[m| + /2|m| — m? 
T> or 


10.1 
DENTE ee 


< 
i 2 
For m = —1 modes this gives the stability condition 
x> I. (10.189) 


In un-normalized quantities and using k = 27/L where L is the length of the system, the 
stability condition is 


this is known as the Kruskal-Shafranov stability criterion (Kruskal, Johnson, Gottlieb and 
Goldman 1958, Shafranov 1958). 

For m = —2 modes, the two roots coalesce at x = 1 and so Eq.(10.189) also gives 
stability. For m > 3 the argument of the square root in Eq.(10.187) is negative so there is 
no region of instability. 

In toroidal devices such as tokamaks, the axial wavenumber corresponds to the toroidal 
wavenumber since the dominant magnetic field is in the toroidal direction, i.e. Boz — Bog 
where œ is the toroidal angle. The axial wavenumber k becomes n/R where n is the 
toroidal mode number. Since we saw that long axial wavelengths are the most unstable we 
take n = 1. The Kruskal-Shafranov kink stability condition becomes 


_ 2 Bog 
1= RBo 


where q is called the safety factor. Typical tokamaks operate with q ~ 3 at the wall; this 
q condition is one of the most important design criteria for tokamaks since it dictates the 
magnitude of the large and expensive toroidal field system. 


>1 (10.190) 
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10.9 Assignments 


Interchange instabilities and volume per unit flux- Another way to consider inter- 
change instabilities is to calculate the consequence of interchanging two flux tubes 
having the same magnetic flux. Doing so will not change the magnetic energy since 
the magnetic field is unchanged by this interchange. However, if the flux tubes con- 
tain finite-pressure plasma and the volumes of the two flux tubes differ, then the inter- 
change will result in compression of the plasma in the flux tube which initially had the 
larger volume and expansion of plasma in the flux tube which initially had the smaller 
volume. The former requires work on the plasma and the latter involves work by the 
plasma. If the net work must be done on the plasma to effect the interchange, then 
the interchange is stable and vice versa. In a magnetic confinement configuration, the 
high pressure is by assumption on the inside of the configuration and the low pressure 
is on the outside. Thus, the question is whether interchanging a high pressure, inner 
region flux tube with a low pressure out region flux tube requires positive or negative 
work. 


(a) Show that the volume per unit flux in a flux tube is given by 


vege 
B 

where the contour is over the length of the flux tube. Hint: Consider that BA = 

const. on a flux tube. 


(b) Show that instability corresponds to V’ increasing on going outwards. 


(c) Consider the magnetic field external to a current-carrying straight wire. How 
does V’ scale with distance from the wire and would a plasma confined by such 
a magnetic field be stable or unstable to interchanges. Is this result consistent 
with the concepts of good and bad curvature? 


. Work through the algebra of the magnetic energy principle and verify Eqs.(10.113), 


(10.114), (10.115), and (10.116). 


. Show that the force on a plasma in an arched magnetic field with fixed ends tends 


to push the plasma towards the shape of a vacuum magnetic field arch having the 
same boundary conditions. Under what circumstances will the force (i) increase the 
magnetic field arch major radius, (ii) decrease the magnetic field arch major radius, 
(iii) leave the magnetic field arch major radius unchanged. Show that, in contrast, the 
force on a plasma containing an arched current always tends to increase the major 
radius of the arched current. 


Show that if the wall radius b — oo, then Eq.(10.174) reduces to the condition 


ze] (m + 2Bovz)” Km(2) 1 > 0 for stability 


Ta (2) z Ki, (2) 
where x = ka. Assume that Bove = Bosz and use a computer to evaluate the modified 


Bessel functions numerically. Make a plot of this expression in a parameter space 
where the vertical axis is x and the horizontal axis is Bove/Bovz = 1/Bovz and in 


puz 
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particular, indicate by shading regions where F < 0. Do this for both m = 0 and for 
m = —1so as to show the regions of the parameter space where kinks and sausages 
are unstable. For a given k show on the plot how raising the axial current would lead 
to kink or sausage instability. Which happens first (kink or sausage)? 


. Are kink instabilities diamagnetic or paramagnetic with respect to the axial field? 
To find this, consider the kinked current as a solenoid and determine whether the 
orientation of the solenoid is such as to add or subtract from the initial B, field. 


. According to Eq.(10.139) current driven instabilities cause the plasma to relax to a 
situation where uoJ = AB. 


(a) Show that a class of solutions to this can be found if it is assumed that B has the 
form 
B=AVy x 2+V x (Vy x 2) 
provided V?y + A? Y = 0. 
(b) Suppose that y(x, z) = f(z)cos(kx). What is the form of f(z)? Show that 
solutions which decay in z are only possible for a certain range of A^. 
(c) Consider a two dimensional force-free solar coronal loop. Let the z = 0O plane 


denote the solar surface. Calculate the functional form of B, and B, using the 
solution in part (a) above. 


(d 


wm 


Sketch the shape of the field line going from x = —7/k to x = +77/k in the 
z = 0 plane. Do this for a sequence of increasing values of d?. What happens to 
the shape of the field line as A? is increased? How is current related to à? 


11 


Magnetic helicity interpreted and 
Woltjer-Taylor relaxation 


11.1 Introduction 


The previous chapter introduced the concept of magnetic helicity by using arguments de- 
riving from the energy principle and showed that global helicity K = f @PrA-Bisa 
conserved quantity in an ideal plasma. We show in this chapter that magnetic helicity has 
an important topological interpretation and furthermore that the conservation property of 
global helicity is more robust than magnetic energy conservation. By robust, it is meant 
that when some dissipation exists and the plasma is no longer ideal, helicity conservation 
holds up much better than magnetic energy conservation, i.e., helicity is nearly conserved 
even though energy is not conserved. This difference in the relative loss rates of energy and 
helicity provides the basis for the Woltjer-Taylor theory which predicts a minimum-energy, 
conserved-helicity final state which any arbitrary initial state will always relax towards. 
Relaxation theory has been very successful for predicting the general behavior of many 
laboratory, space, and astrophysical plasmas. Finally, it will be shown that helicity can be 
manifested in more than one way and in particular that the kink instability can be consid- 
ered as a conversion of helicity from one form (twist) to another form (writhe). 


11.2 Topological interpretation of magnetic helicity 


11.2.1 Linkage helicity 


Consider the two thin linked untwisted flux tubes labeled as flux tube #1 and flux tube #2 
shown in Fig. 11.1. The respective fluxes are ©, and ®ə and the tube axes follow the 
contours C and C. and the tube volumes are V; and V2. The magnetic field is assumed to 
vanish outside of the flux tubes. 

The helicity of the volume V containing the two linked flux tubes is 


K= | A-Bd?’r. (11.1) 
V 
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Because B vanishes outside of the two flux tubes, the helicity integral is finite only in the 
volumes V; and V2 of the two tubes and so the helicity can be expressed as (Moffatt 1978) 


K= | A-Bd’r+]/ A-Bd?r. (11.2) 
Vi V2 
The contribution to the helicity from integrating over the volume of flux tube #1 is 


K,= | A- Bar. (11.3) 
Wi: 
In order to evaluate this integral it is recalled that the magnetic flux through a surface S 
with perimeter C' can be expressed as 


v= | Bas= g A-dl. (11.4) 
Ss E 


In flux tube #1, d?r = dl- BAS, where AS is the cross-sectional area of flux tube #1 and dl 
is an element of length along C4. It is therefore possible to recast the integrand in Eq.(11.3) 
as 


A-Bdr = A-Bdl-BAS 
A-dl®, (11.5) 


since dl is parallel to B and BAS = ®,. Because ®, is by definition constant along the 
length of flux tube #1, it may be factored from the Kı integral, giving 


Kı = ®ı A-dl. (11.6) 
Cı 
However, the flux linked by contour C is precisely the flux in tube #2, i.e., JS A A-dl = v 
and so 
Kı = $1 8p. (11.7) 
The same analysis applied to flux tube #1 leads to K2 = ®,®2 and so the helicity content 
of the linked flux tubes is therefore 


K = Kı + Ko = 201 02. (11.8) 


P a 


flux ®ı flux D2 
contour Cı contour C2 
volume V; volume V2 


Figure 11.1: Two linked thin untwisted flux tubes. Magnetic field is zero outside the flux 
tubes. 
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The flux tubes did not actually have to be thin: a fat flux tube #2 could be decomposed 
into a number of adjacent thin flux tubes in which case K, would be ®, times the sum of 
the flux in all of the thin #2 flux tubes. The helicity is thus just the sum of flux tube linkages 
since if the two flux tubes each had unit flux, there would be one linkage of flux tube #1 with 
flux tube #2 and one linkage of flux tube #2 with flux tube #1. As a generalization, if flux 
tube #2 wrapped around flux tube #1 twice, then the contributions would be Kı = 20,2 
and Kə = 2®, ə in which case the helicity would be K = 4, ® 3. This would correspond 
to two linkages of flux tube #2 on flux tube #1 and two linkages of flux tube #1 on flux tube 
#2. 


flux tube #1 


of flux tube #2 


flux tube #2 


Figure 11.2: Flux tube #2 deformed so that it tightly encircles flux tube #1 and its 
cross-section is squeezed and stretched so as to uniformly cover flux tube #1 like a coat 
of paint (shaded area). 


11.2.2 Twist helicity 


Now suppose that the major radius of flux tube #2 is first shrunk until flux tube #2 tightly 
encircles flux tube #1 and then as sketched in Fig.11.2, the cross-section of flux tube #2 is 
both squeezed and stretched until its field lines are uniformly distributed over the length of 
flux tube #1; the field lines of flux tube #2 are shown shaded in Fig.11.2. The result is that 
the volume of flux tube #2 is like a thin coat of paint applied to flux tube #1 (shaded region 
in figure). Let P denote the flux in flux tube #1 and dw denote the flux in flux tube #2 so, 
using Eq.(11.8), the helicity in this configuration is seen to be 


dK = 20dy). (11.9) 
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We next add another layer of ‘paint’ with more embedded y flux, and also with embed- 
ded ® flux so that both 7 and ® increase. The value of ® can be used to label the layers 
of ‘paint’ so that ® is the amount of flux in flux tube #1 up to the layer of ‘paint’ labeled 
by ®. Furthermore, since ~ increases with added layers of ‘paint’, Y must be a function of 
the layer of ‘paint’ and so Y = u(®). It is therefore possible to write dy = w'd® where 
w' = dw/d®. Thus, the amount of helicity added with each layer of ‘paint’ is 


dK = 20 y'd® (11.10) 


and so the sum of the helicity contributions from all the layers of ‘paint’ is 


roo) 
K=2f b y'do. (11.11) 
0 


We now show that 2)’ represents the twist of the embedded magnetic field. Let œ be the 
angle the long way around flux tube #1 and 0 be the angle the long way around flux tube 
#2 as shown in Fig.11.2. Thus increasing ¢ is in the direction of contour C4 and increasing 
0 is in the direction of contour C2. The perimeter of a cross-section of flux tube #1 is in the 
0 direction and the perimeter of a cross-section of flux tube #2 is in the ¢ direction. The 
magnetic field in flux tube #1 can be written as 

1 


1 
Bı = —V® x VO = — V x V0 (11.12) 
2T 2T 


which is in the @ direction since V® is orthogonal to V0. Here © is the flux linked by a 
contour going in the direction of V0. To verify that this is the appropriate expression for 
B,, the flux through the cross section 5; of flux tube #1 is calculated as follows: 


flux through Sı = f os-Bi 

Sı 
1 

So ds-V x ®V@ 
T Sı 
1 

= — dl-®V@ 
2T Jo, 
p 

— dl-V@ 
2T Jo, 
p 

= — o~dé 
2T 

= ð. (11.13) 


The flux ® can be factored from the integral in the third line above, because ® is the flux 
linked by contour C2 which goes in the direction of V0. Similarly, it is possible to write 


1 1 
B2 = — Vy x Vo = —V x VV (11.14) 
2T 2T 


which is in the 8 direction since Vv is orthogonal to V¢. Since B; is in the ¢ direction and 
B» is in the 0 direction, the total magnetic field can be written as B =B,¢ + B20 which is 
helical. 
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The twist of the magnetic field is defined as the number of times a field line goes around 
in the @ direction for one circuit in the ¢ direction. If dly is a displacement in the ¢ direction 
then 

dọ = |V¢| dlg (11.15) 
and similarly 

dé = |V6| dig. (11.16) 
The trajectory of a magnetic field line is parallel to the magnetic field so if dl is an increment 
along a magnetic field line then Bxdl =0 or 


dig dl 
L e, 11.17 
n (11.17) 


This means that 
dé do 


|V6| Bo |Vo| Ba” 
However Bg = |V¥||V¢| /27 and Bg = |V®| |VO| /27 so 


(11.18) 


do V0| Bo 
do Vo| By 
VOL Veil [Vol /2n 
Vo |V®| [VO] /27 


_ [vy 
= Wa; (11.19) 


Finally, if Y = (®) then Vy = W/V® where y'= dy/d® and so 


do, 
Say. 11.2 
ad Y (11.20) 


Thus, 0 increases w’ times faster than does ¢ and so if @ makes one complete circuit (i.e., 
goes from 0 to 27), then 0 makes 3)’ circuits (i.e., 0 goes from 0 to 2m4’). The number 


of times the field line goes around in the 0 direction for each time it goes around in the ¢ 
direction is called the twist 

T(®) =. (11.21) 
Hence Eq. (11.11), which gave the helicity when the ~ flux is embedded in the ® flux, can 
be expressed in terms of the twist as 


K= 2 | or(@ao. (11.22) 


and if the twist is a constant (i.e., T” = 0) then 


K = TẸ’. (11.23) 
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11.2.3 Conservation of magnetic helicity during magnetic reconnection 


Ideal MHD constrains magnetic flux to be frozen into the frame of the plasma. This 
means that the topology (connectedness) of magnetic field lines in a perfectly conducting 
plasma cannot change. As will be shown in Chapter 12, introduction of a small amount 
of resistivity allows the frozen-in condition to be violated at locations where the velocity 
must vanish due to symmetry. At these locations, the approximation of the resistive MHD 
Ohm’s law E+ U x B = ņJ to the ideal form E+ U x B = O necessarily fails. In- 
stead, Ohm’s law has the form E = ņJ so that magnetic field lines can diffuse across the 
plasma and reconnect with each other. Even though the reconnection is localized to a small 
region in the vicinity of where U vanishes, it nevertheless changes the overall magnetic 
field topology. As will be shown below, reconnection destroys individual linkages between 
flux tubes, but creates a replacement linkage for every destroyed linkage so that the total 
system helicity is conserved. The result is that the ideal MHD constraint of having per- 
fect flux conservation everywhere is replaced by the somewhat weaker constraint that total 
magnetic helicity is conserved. Reconnection necessarily involves energy dissipation since 
reconnection requires finite resistivity. Thus reconnection dissipates magnetic field energy 
while conserving magnetic helicity. 

Conservation of helicity during reconnection is demonstrated in Fig. 11.3. Two linked 
untwisted ribbons of magnetic flux are shown in 11.3(a). If each ribbon is imagined to be 
a magnetic field line bundle having nominal flux ®, then according to Eq.(11.8) this initial 
configuration has helicity K = 27. The ribbons are then cut at their line of overlap as 
in Fig.11.3(b) and reconnected as in Fig.11.3(c) to form one long ribbon. This long ribbon 
in Fig.11.3(c) may then be continuously deformed until in the shape shown in Fig.11.3(f), 
a long ribbon having two twists. Thus, the twist parameter for Fig.11.3(f) is T = 2 and 
so according to Eq.(11.23) this two-twist ribbon has a helicity K = T®? = 27. Hence, 
magnetic helicity is conserved by the reconnection. Since the magnetic equivalent of cut- 
ting ribbons with scissors involves the dissipative process of resistive diffusion of magnetic 
field across the plasma, it can be concluded that magnetic reconnection conserves helicity, 
but dissipates magnetic energy. 


11.3 Woltjer-Taylor relaxation 


Woltjer (1958) provided a mathematical proof that the lowest energy state of a magnetic 
system with a fixed amount of magnetic helicity was a certain kind of force-free state, but 
did not provide any detailed explanation on how the system would attain this state. Taylor 
(1974) argued that because reconnection conserves helicity while dissipating magnetic en- 
ergy, reconnection events will provide the mechanism by which an isolated plasma relaxes 
towards a state having the lowest magnetic energy consistent with conservation of helicity. 
The discussion of Eq.(10.133) showed that the isolation requirement corresponds to having 
no field lines penetrate the surface bounding the plasma and also arranging for this surface 
to be an electrostatic equipotential. It will therefore be assumed that the plasma is bounded 
by a perfectly conducting wall and that no magnetic field lines penetrate this wall. The 
tangential electric field and hence the tangential component of the vector potential must 
therefore vanish at the wall. 
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Figure 11.3: (a) Two linked, untwisted flux tube squeezed to be ribbons, magnetic field 
directions shown by arrows; (b) cut at overlay of two ribbons; (c) reconnection at cut; (d) 
deformation; (e) more deformation; (f) more deformation showing there are two complete 
twists of magnetic field. 
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The relaxation process involves minimizing the magnetic energy 


_ 1 243 
W= Brd’r (11.24) 
2Ho 


subject to the constraint that the magnetic helicity 


K= | A-Be’r (11.25) 


is conserved (Woltjer 1958, Taylor 1974). The minimum-energy magnetic field for the 
given helicity is called B y g and its associated magnetic energy is 


1 , 
WME = Do J B? gd’r. (11.26) 
0 


In order to determine B mp, we consider an arbitrary variation B = Bmp + OB satis- 
fying the same boundary conditions; by assumption, this variation has a higher associ- 
ated magnetic energy than B mg. The magnetic field B has an associated vector potential 
A = Aye+0A and because the tangential electric field must vanish at the wall to prevent 
helicity flux across the wall, the tangential component of 6A must also vanish at the wall. 
A naive solution for minimizing Wm g would be to set B mpg to zero, but this is forbidden 
because it would make K vanish and violate the helicity conservation requirement. Thus 
W must be minimized subject to the constraint that kK remains constant. The variation of 
the magnetic energy about the minimum energy state is 


1 
2Ho 
1 3 
Eru Bmg:Bd T 

Ho 


ôW 


if (Bare + 6B)? — Bi, p| dr 


1 
— | Bue: Vxsadr 
Ho 
a8 
Ho 
1 
— [as Bue x 5A4 | 5A- Y x Burd’r 
0 


[V $ (BmE x ôA) + ôA- V x Bw] er 


= po V x Byrrd?r (11.27) 


since ds- Bmp x 6A =0 everywhere on the wall. The variation of the helicity is 


ôK 


foa f BME oF Awer-6B) dr 


= [OA Bue + Awe-V x 6A) Br 


= [OA Bue + 6A-V x Awe) +V- (6A x Aye) d?r 


= 2 [6A Bue dr. (11.28) 
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Minimization of 6W subject to the constraint that K remains constant is characterized 
using a Lagrange multiplier so that the constrained variational equation is 


OW = 6K. (11.29) 
Substitution for 6bW and ôK gives 


[5A Y x Burr =2) | 6A-Bure dr (11.30) 


or, after re-defining the arbitrary parameter A, 


fos (V x Bugz — àBmpr) d?r = 0. (11.31) 
Since ôA is arbitrary, the quantity in parentheses must vanish and so 


Vx Bye = ABme. (11.32) 


It is clear that À is a spatially uniform constant because the Lagrange multiplier is a con- 
stant. Thus, relaxation leads to the same force-free state as predicted by Eq.(10.140). These 
states are a good approximation to many solar and astrophysical plasmas as well as certain 
laboratory plasmas such as spheromaks and reversed field pinches. 

The energy per helicity of a minimum energy state can be written as 


W 2 f Birgar 
K 2o | Ave-Bugd3r 


JBumeV x Amgd?r 
2o | Awe-Bupd?r 
f [Ame V x B+V: (Aue x Bwe)| dr 


= ; 11. 
2o f A - Bar ee) 
However, Eq. (11.32) can be integrated to give 
Bme =AAme+Vf (11.34) 


where f is an arbitrary scalar function of position. This can be used to show 


V- (Ame X Bue) = V-(Ame X Vf) = Vf-V x Ame =Bue-Vf =V-(fBue) 


(11.35) 
so that 
fy (Ame x Bmg) d?r = os (fBurez) =0 (11.36) 
since B mg - ds = 0 by assumption. Thus, Eq.(11.33) reduces to 
Ww AÀ 
OERS (11.37) 
K 2m 


indicating that the minimum energy state must have the lowest value of A consistent with 
the prescribed boundary conditions. 
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11.4 Kinking and magnetic helicity 


Magnetic helicity can be manifested in various forms and plasma dynamics cause this form 
to be altered. We now discuss an important example of helicity-conserving, morphology- 
altering dynamics, namely the situation where kink instability causes the twist of a straight- 
axis flux tube to be transformed into the writhe of the axis of an untwisted flux tube (Berger 
and Field 1984, Moffatt and Ricca 1992). 

Consider a volume V bounded by a surface S where no magnetic field lines penetrate 
the surface S' so that B-ds = 0 over all of S. It is assumed here that V extends to infinity, 
but the analysis would equally apply for a finite-extent volume V so long as no magnetic 
field lines penetrate its bounding surface S. The helicity contained in V is 


Ky = | A- Bar. (11.38) 
v 


Equation (10.129) showed that Ky is gauge-independent because no field lines penetrate 
S. 

The volume V is now decomposed into two subvolumes which have B-ds = 0 at all 
locations on their mutual interface. Thus, any specific field line in V is entirely in one or 
the other of the two subvolumes. 

1. The first subvolume, called Vube, is a closed flux tube of minor radius a with a pos- 
sibly helical axis as shown in Fig. 11.4. The length of the axis of this flux tube is 
denoted lazis and the variable € denotes the distance along this axis from some fixed 
reference point x9 on the axis. Incrementing € from 0 to lazis thus corresponds to 
going once around the flux tube axis. A pseudo-angular coordinate ¢ is defined as 


3 


T 


ġ= 2r (11.39) 
so that going once around the axis corresponds to incrementing ¢ from 0 to 27 and Q 
defines the local direction of the axis. Because the axis of the flux tube has the complex 
curvature of a helix, everywhere along the axis there exists a radius of curvature vector 
Kk = ġo- Vo which is at right angles to ¢. The local radius of curvature of the axis 


` apd 
iS. Fourie = lo - vo (see discussion of Eq.(3.85)). From the point of view of 


an observer inside the flux tube, the flux tube appears as a long curved tunnel that 
eventually closes upon itself. The flux tube minor radius a must always be smaller 
than recurve Which corresponds locally to the condition that by definition the major 
radius of a torus must exceed the minor radius. The flux tube interior volume is now 
imagined to be filled with fiduciary lines all of which are parallel to the flux tube axis; 
the lengths of these fiduciary lines will vary according to their location relative to the 
axis. We define £’ as the distance along a fiduciary line of length l’ from a plane 
perpendicular to the flux tube axis at the reference point x9. The meaning of ¢ can 
then be extended to indicate the distance along any fiduciary line using the relation 
E' = ¢l'/27-. Incrementing ¢ from 0 to 27 thus corresponds to going once around any 
or all of these fiduciary lines and so ¢ provides an unambiguous measure of fractional 
distance along the flux tube for any point within the flux tube even though the flux 
tube may be curved, twisted, or helical. 
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helical flux tube, 
volume Vube 


axis of helix 


linked external flux Y ext 


Figure 11.4: Subvolume #1 is a flux tube with a possibly helical axis and volume Vixbe, 
Subvolume #2 is all space external to subvolume #1. All field lines in subvolume #2 which 
link subvolume #1 are represented by the vertical flux tube in figure labeled w,,.,. 


2. The second subvolume in this decomposition is the remaining volume of V and is 
labeled as V.,;. There might be field lines in this second subvolume which link the 
flux tube Vube and the flux due to all these linkages is labeled 7,,,,. This external flux 
linkage is represented by the vertical flux tube labeled Y., in Fig. 11.4. 

If the dynamics within the volume V are governed by ideal MHD, then the helicity Ky 
in V must be conserved and, since Ky is a volume integral, the volume of integration can 
be subdivided into the flux tube volume and the volume external to the flux tube, 


Kv = Kyuve vig 


e 
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11.4.1 Ky, 


helicity content of the flux tube 


ube? 


We now consider the helicity content of the possibly helical flux tube, 


Kv, = A-Bd?r. (11.40) 


Viube 


The magnetic field in the flux tube is decomposed into a component Bazis parallel to the 
flux tube axis and an orthogonal component Bazimuthal Which goes the short way around 
the axis; thus the magnetic field inside the flux tube is 


B = Bazis + Bazimuthal- (11.41) 


The field lines in the flux tube are assumed to lie in successive layers (magnetic surfaces) 
wrapped around the flux tube axis. The axial and azimuthal magnetic fields are derived 
from respective vector potentials Aaris and Agzimuthal Such that 


Baris = Vx Aazis 
Bazimuthal = Vx Aazimuthal- (11.42) 


It should be emphasized that these definitions say nothing about the direction of Aaxis or 
Aazimuthal and so, unlike axisymmetric situations, here neither Agzis nor Agzimuthal 
should be construed to be in a particular direction because of the possibility that the flux 
tube is helical. All that can be said is that the curl of Agi; gives the flux tube axial 
magnetic field and the curl of Agzimuthal gives the azimuthal field. 

The helicity content of the flux tube is thus 


tae (Aazis + Aazimuthal) j (Bazis F Bazimuthal) d?r d 1.43) 
Vtube 
and this is true even though the axis of the flux tube could be helical. 


By definition, each layer of field lines constituting a magnetic surface encloses a flux 
®. An equivalent definition is to state that ® is the magnetic flux linked by a contour in the 
magnetic surface going the short way around the flux tube axis. The magnetic surface is la- 
beled by $ and so ® can be considered to be a coordinate having its gradient always normal 
to the flux surface. In effect ® is a re-scaled minor radius, since ® increases monotonically 
with minor radius. 

These definitions are sufficiently general to allow for the axis of the flux tube to be a 
helix, a knot, or a simple closed curve lying in a plane. If the axis is just a simple closed 
curve lying in a plane, then V x Q is normal to the plane and therefore normal to Q. In the 
slightly more general case where the axis is not in a plane but Q- Vx b= O, the path traced 
out by ¢ can be considered to be the perimeter of a some bumpy surface. However, in the 
most general case where o-V x œ is finite, the path traced out by ¢ is not the > perimeter of 
even a bumpy surface (Barnes 1977). There might also be situations where o-Vxo #0 
but the sign of Q- V x Q alternates. If the average of Q- V x @ over the length of the axis 
is zero, then the situation would be similar to the case where Q- Vx b= = 0 everywhere, 
because twists in the axis could be squeezed together axially until mutually cancelling out. 
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Figure 11.5: Ribbon surface extending from helical axis of helical flux tube to surface of 
flux tube with normal to ¢ x & 


Because any magnetic field line in the flux tube lies in some magnetic surface labeled 
by 9, any field line in the flux tube has a component parallel to the axis and possibly also 
a component perpendicular to the axis, but never a component in the V® direction. This 
suggests introduction of an azimuthal coordinate 0 which is defined as the angular distance 
around the axis in the direction given by Vo x V®, i.e., 0 is defined such that 


VO Vox Ve 
[V6] [Vox VO] 


The local direction of VO depends on both ¢ and 0; furthermore, for 0 to denote some 
definite position, 0 must be measured with respect to some unambiguous origin. An unam- 
biguous 0 origin can be defined by using as a reference the negative of the direction of the 
local radius of curvature vector K of the flux tube axis. The 0 = O surface is then a ribbon 
surface, as shown in Fig.11.5, that extends from the flux tube axis to the flux tube outer sur- 
face so that Q x K is normal to the ribbon surface. The ribbon is considered to have a cut 
at @ = Oso that the ribbon ends at ¢ = O and ¢ = 27 are distinct. 


(11.44) 
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Figure 11.6 shows a cross-section of this system. The flux tube has minor radius a and 
the helical axis of the flux tube traces out a trajectory with minor radius b about an axis 
which has major radius R. Representative magnetic surfaces (surfaces of constant ®), the 
ribbon surface, and the angle 0 are also shown. 


flux tube 
surface 


ribbon 
surface 


axis of helix 


axis of flux tube 


magnetic surface, 


Ea constant ® 
+— 2) — 


Figure 11.6: Cross-section showing flux tube of minor radius a, with its axis making a helix 
of radius b. Interior flux surfaces with © = const. and the ribbon surface defining 0 = 0 
are also shown. 


The flux ® was a generalization of the toroidal flux of an axisymmetric system. We 
now define a corresponding generalization of the poloidal flux 7)(®) as the magnetic flux 
penetrating a sub-ribbon extending inwards from the outer surface of the possibly helical 
flux tube to some given interior magnetic surface ®. This subribbon is shown in Fig.11.7 
and the definition is such that 


Y = i] ds-B 
subribbon 


f ds-VxA 
subribbon 


= faa (11.45) 
C 


where the contour C follows the perimeter of the subribbon and specifically follows the 
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two ends at 6 = 0 and ¢ = 27. This definition implies that 7 = O on the outer surface of 
the flux tube because the sub-ribbon area will be zero in this case. 


We define the vector potential 


WT) Vo (11.46) 
2T 


Aazimuthal = 


and note that this definition is valid only in the range O < ¢ < 27. The appropriateness of 
this definition is established by integrating following the contour C' around the subribbon 
to obtain 


$ dl- Aazimuthal = f d- yg 
c c 2T 


Il 
= 


(11.47) 


where dl-V@ = dø. As seen in Fig.11.7, the contour C consists of four segments, namely a 
segment following the outside edge of the sub-ribbon (this segment lies in the outer surface 
of the flux tube where ~ = 0), the segment labeled Ce on the inside edge of the sub-ribbon, 
a segment at the ¢ = 0 end, and an oppositely directed segment at the ¢ = 27_ end. Only 
the Cg segment contributes to the integral because (i) dl-Vé = 0 on the ends ¢ = 0 and 
h = 27_ since ¢ is constant on both these ends and (ii) 7 = 0 on the outside edge of the 
sub-ribbon. 


The magnetic field inside the flux tube can thus be written as 


(© 
B = Basis + V x Ogg (11.48) 


where Bazis is the magnetic field component parallel to the flux tube axis. This decom- 
position of the magnetic field is valid even for a twisted or knotted axis provided it is used 
only for situations where 0 < @ < 2r. 
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Figure 11.7: 4 is flux through cut subribbon located between magnetic surface ® and outer 
surface of flux tube. 


The vector potential for the entire field can be written as 


b(D 
itech WE a, (11.49) 
2T 
These results can be used in Eq.(11.43) to give the helicity content of the flux tube as 
b(D 1 
K Vous = ji d?r (Anni + ve) $ (Bazi + — Vy x ve) 
Veuve 20 2T 


(© 1 
I d?r (Anni . Bazis + WP) og Bazis + Aazis Ë = VW x ve) 
Veuve 21 2T 
(11.50) 


and this may be decomposed into writhe and twist terms, 


KV,uve = Kurithe + Ktwist 


respectively defined as 


Kwrithe = J d?rAazis - Bazis (11.51) 
Vi 


ube 
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and 


p(® 1 
Kiwis = f d?r (Pv . Bazis + Aazis ý 57 Vy x ve) = (11.52) 
Voie 2T 2T 


As will be shown below, Ktwist depends on ~ being finite whereas Kwrithe depends on the 
extent to which the flux tube axis is helical. The flux tube helicity can thus be entirely due 
to Kwrithe, entirely due to Kiwist, or due to some combination of these types of helicity. 


11.4.2 Evaluation of Kivist 


Since the Ktwist integral is a volume integral, the flux tube may be cut at ¢ = 0 without 
affecting this integral. Making such a cut means that ¢ is restricted to the range 0 < ¢ < 
27 and therefore does not make a complete circuit around the axis of the flux tube. The 
evaluation of this integral is insensitive to the connectivity of the axis because connectivity 
is a concept which makes sense only after making a complete circuit of the axis. The axial 
magnetic field within the cut volume may be expressed as 


Basis = ty x OVE (11.53) 
20 


where 0 is the angular distance on a contour Cg encircling the flux tube axis and lies in a 
magnetic surface. This representation for the axial magnetic field is appropriate here since 
at each ¢ for O < ġ < 27 we may write 


ẹ = Jes -Bazis 


+ ds-V x PV8 
2T 
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11.54 


By uncurling Eq.(11.53) it is seen that vector potential associated with the axial magnetic 
field may be represented in the cut flux tube as 


Agee Žv, O< g< 2r (11.55) 
so that 
Kivist = <5 | NE d?r (YVo- VE x VO +8V0 -Vy x Vo) 
= = N d?r (YVp- VE x VO+ by'VO- Veh x Ve) 
1 3 i nI 
=i d?r (— + Oy’) VO- VE x Vo. (11.56) 


Viube 
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The three direction gradients V0, V®, and Vo form an orthonormal coordinate system 
and an element of volume in this system is given by 


dbdbd@ 
V-V x Vo 
since d0 = dig |V6|, dọ = dig |V¢| , and d® = dig |V®|. Thus, 


ie a: oe (27) _ 
Kiwist = Ws d® dé do (—4 + by) 
4T? Jo 6=0 ¢=0 
® 


® 
= -f vao+ f y' bdb 
0 0 


= 2 [eae (11.58) 
7 o db 


d?r = dlgdigdly = (11.57) 


where the integration limit (27) — corresponds to being infinitesimally less than 27. The last 
line has been obtained using the relationship f d(u) = [vo]s"* = f day + f vad 
and noting that the integrated term vanishes since 7 = 0 in the flux tube surface and 6 = 0 
on the flux tube axis. 


11.4.3 Evaluation of K writne 


Connectivity of the flux tube is now the important issue. In order to evaluate the Kwrithe 
integral the volume element is expressed as 


d?r = dl-ds (11.59) 


where dl is an increment of length along the axis and ds is an element of surface in the plane 
perpendicular to the axis so that Bazxis'ds = d®. Because the line integral will involve a 
complete circuit of the flux tube axis, in contrast to the earlier evaluation of Ktwist, we now 
avoid using the gradient of a scalar to denote distance along the axis. Using Eq.(11.59) the 
writhe helicity may therefore be expressed as 


Kwrithe = f Aaris > dlocis feo 
Caxis 


= o f Aaris ` dlazis- ad 1.60) 
Cagis 

This integral differs topologically from the integrals of the previous section, because here 
the contour is a complete circuit, i.e., ọ varies from 0 to 27, and not from 0 to 2r_. 

A contour path C of an integral f c A-dl may be continuously deformed into a new 
contour path C’ without changing the value of the integral provided no magnetic flux is 
linked by the surface S bounded by C and C”; this behavior is a three dimensional analog 
to the concept of analyticity for a contour integral in the complex plane. The validity of 
this assertion is established by the relation 


o= | Bas=fAa-ai= f A-a- A-dl. (11.61) 
s C c! 
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It should be noted that the ability for C to be continuously deformed into C” imposes the 
requirement that a surface S exists between C' and C”; this is only true if C does not link 
C". 

We now consider some limiting cases for Kwrithe and to aid in visualization it should 
be recalled that the flux tube axis is assumed to have a helical trajectory with helix major 
radius œR and helix minor radius b as shown in Fig.11.6. 

Limiting case where flux tube axis not helical 

This situation corresponds to having b = O in which case the flux tube axis lies in 
a plane. The contour Cais in Eq.(11.60) can thus be slipped from its original position 
through the Bazi; field lines to the surface of the flux tube without crossing any Bagis 
field lines. This is topologically possible because by definition, all Basis field lines in the 
flux tube are parallel to the flux tube axis. Thus, here 


K writhe = o f Aazis ` dlazis (11.62) 
Cr 


where C” lies on the flux tube surface. The line integral C” encircles the external flux linked 
by the flux tube and so 


K writhe = PWext, When the flux tube axis is not helical (11.63) 


helical axis 
of flux tube 


ZN 


R 


CES 


Eee 
a <S 


slit cut in flux tube 
parallel to flux tube 
axis, faces helical axis 


Figure 11.8: Helical axis of flux tube may be slipped through Bazis field lines of flux tubes 
to coincide of axis of helix (note that open slit in flux tube always faces axis of helix). 
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Case where axis is helical and b > a 

Because the helix minor radius b exceeds the flux tube minor radius a, the entire flux 
tube revolves around the axis of the helix as shown in Fig.11.8. Because the helical axis 
closes upon itself, it must have an integral number of periods and we let N be the number 
of helix periods. However, because Bagis is everywhere parallel to the flux tube axis, the 
flux tube axis may again be slipped through the B,.;, field lines and in this case moved 
towards the helix axis until it coincides with the helix axis C).);2. The writhe helicity can 
thus be expressed as 


Kurithe = of Aaris > dlozis- (11.64) 
Ch 


eliz 


However, the flux tube links the helix axis N times and so 


| Aazis ` dlazis = NO + UTA (1 1.65) 
Chelie 


Kurithe = ND? + BY,,, if b> a. (11.66) 


Case where flux tube axis is helical but a > b 

In this situation the flux tube may be subdivided into an inner core with minor radius 
r <b and associated flux Pb?/a? and an outer annular region with b < r < a with 
the remainder of ©. We again slip the flux tube axis through the parallel Bazis field lines 
to be coincident with the helix axis. However, now only the inner core of the flux tube 
(r < bazis) rotates around the helix axis. The outer annular region of the flux tube merely 
wobbles around the helix axis and does not link the helix axis. Thus, the linked flux is 


f Aazis ` dlazis = NDD? Ja? + Went (11.67) 
Chelie 


and so the writhe helicity is 
Kwrithe = N®767/a? + Dyert if a> b. (11.68) 
Hence, the helicity of the flux tube is 
Ky, = Kiwist + Kurithe 


ube 


dy , 
= 2 opa + NGO? /a? + Dye if a> b. (11.69) 


The distinction between the a < b and a > b cases is sketched in Fig.11.9. 
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Figure 11.9: Top: case where flux tube radius a is less than radius b of helix traced out by 
flux tube axis. Bottom: Case where a > b; the core region 0 < r < b rotates around the 
axis of the helix whereas the outer region with b < r < a wobbles about the axis of the 
helix. 


11.4.4 Twist and writhe exchange in a kink instability 


A kink instability is governed by ideal MHD and so must be helicity-conserving. The 
number of turns of the kink is defined by the initial twist condition k - B =kgBg+koBo = 
0 so if ke = 1/a and kg = n/R then the kink will result in a helix with N = n. Since 
helicity is conserved, it is seen that as the amplitude of the kink increases (i.e., b increases), 
the flux tube twist dy /d® will have to decrease. In particular if the twist is uniform so that 
Y% = T® where twist T is independent of ®, then as in Section 11.2.2 we may write 


du 
2 opad = To? (11.70) 
do 


and so from conservation of helicity we may conclude that 


T + Nb? /a? = const. for b < a; (11.71) 


this is an example of the Calugareanu theorem (Calugareanu 1959). 
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A kink instability which starts with 6 = O and an initial twist Tinitia = n will grow 


until at b = a the twist vanishes and all helicity is contained by an N-turn writhe where 
N = Tinitiat- Thus, a kink will convert twist into writhe so that for b > a, T = 0 and 
N = Tynitiat- This property can be confirmed by taking a length of garden hose with a 
stripe running along the length and connecting the two ends together to form a closed flux 
tube with a stripe running along the axis. Initially, N + T = O because the stripe is not 
twisted and the hose axis is not helical. If the hose is deformed into a right-handed helix, 
then the stripe will make a left-handed helix about the hose axis to keep N +T = 0. If the 
stripe is initially a right-handed helix when the hose is not a helix, then deformation of the 
hose into a right handed helix will result in the stripe becoming parallel to the hose axis. 


1. 


11.5 Assignments 


Derive the helicity conservation equation for a resistive plasma 


dk 

Tag ds- (yB +Ex A) =-2 | drnI-B (11.72) 
s 

and express it in the form 


OA 
— + | ds- (208 + Ax )= 2 f @rnI-B (11.73) 
S 


ot 


where g is the electrostatic potential. What happens to the surface integral when the 
surface is a perfectly conducting wall? Hint: Start by evaluating —3 (A - B) /0t and 
make repeated use of Faraday’s law V x E = —OB/0t, the electric field expressed as 
E = —Vy — 0B/0t and the resistive MHD Ohm’s law E+ U x B =nJ. 


2. Alternative explanation for why helicity is conserved better than magnetic energy: 


(a) By subtracting E dotted with Ampere’s law from B dotted with Faraday’s law 
derive the MHD limit of Poynting’s theorem: 


2 
Z (F ) pv (23) - E-J (11.74) 
Ot \ 20 Ho 


(b) Consider a plasma in a perfectly conducting chamber with no vacuum gap be- 
tween the plasma and the chamber wall. Show that integration of Eq.(11.72) and 
Eq.(11.74) over the entire volume give respectively 


dK 
— = —2 | drnJ-B 11.7 
J J rn (11.75) 
and dw 
| = | rI. 11.7 
at J rn (11.76) 


where W = f d?r B? /2uo is the magnetic energy. 


(c) Show that the right hand side of Eq. (11.76) is proportional to higher order 
spatial derivatives of B than is the right hand side of Eq. (11.75). Use this 
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property to argue that the rate of dissipation of magnetic energy greatly exceeds 
that of magnetic helicity when the dynamics is spatially complex so that most of 
the spectral power is in short characteristic scale lengths. 


(d 


wm 


Explain why this difference in dissipation rates could be approximated by as- 
suming that magnetic helicity remains constant while magnetic energy decays 
in the presence of dynamics having fine scales. Argue that the decay of mag- 
netic energy is thereby constrained by the requirement that helicity is conserved. 


Show that the minimum energy state given by Eq.(11.32) is a force-free configuration. 
Is \ spatially uniform? Is this the same result as given in Eqs.(10.139) and (10.140)? 
Take the curl again to obtain 


V?B+\°B=0. 
What are the components of this equation in axisymmetric cylindrical geometry? Be 
careful when evaluating the components of V?B to take into account derivatives op- 
erating on unit vectors (e.g., V? (B69) Æ V?’ By). 


(a) Show that for axisymmetric cylindrical geometry the minimum-energy states 
have the magnetic field components 


B-(r) = BJo(Ar) 


Bo(r) = BJ (àr). 
Sketch B,(r) and By(r). This is called the Bessel function model or Lundquist 
solution (Lundquist 1950) to the force-free equation and is often a good first 
guess representation for nearly force-free equilibria such as spheromaks, re- 
versed field pinches, and solar coronal loops. 


. Foracylindrical system with coordinates {r, œ, z} show that if x satisfies the Helmholtz 


equation 
y? X+ rx =0 

then 

B=AVx x Vz + V x (Vx x Vz) 
is a solution of the force-free equation uoJ =AB. By assuming that x is independent 
of ¢ and is of the form x ~ f(r) cos kz find f (r) and then determine the components 
of B. Show that if satisfies an eigenvalue condition, it is possible to have a finite 
force-free field having no normal component on the walls of a cylinder of length h 
and radius a. Give the magnetic field components for this situation. Calculate the 
poloidal flux y(r, z) by direct integration of B,(r,z). Hint: the answers will be in 
terms of Bessel functions Jo and J4. 


. Obtain a ribbon such as is used in gift-wrapping, make one complete twist in this 


ribbon and tape the ends together. By manipulating the twisted ribbon show the fol- 
lowing: 
(a) If the ribbon is manipulated to have no twists, then it has a figure-eight pattern 
with a cross-over. If the flux through the ribbon is ®, show that the helicity of 
one full twist is 7, show that the helicity of one cross-over is also . 
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(b) Cut the ribbon all the way along its length so that it becomes two interlinked 


(c) 


ribbons of unequal width. Assuming that the original ribbon had a flux ®, calcu- 
late the helicity of the configuration taking into account the helicity due to both 
twists and linkages and show that helicity is conserved. 


Demonstrate that the sum of twist plus writhe is conserved using a bundle of 
flexible wires (or ropes) that are initially twisted about a straight axis and then 
manipulated into a kink where the wires are parallel to the locus of the kink. The 
ends of the bundle should be fixed. Similarly, start with a bundle of untwisted 
wires that are parallel to a straight axis and manipulate the wires into a helix. 
Show that the sense of the twist of the wires about their local axis is opposite to 
the sense of the helix so that a condition of zero helicity is maintained. 


12 


Magnetic reconnection 


12.1 Introduction 


Section 2.5.4 established the fundamental concept underlying ideal MHD, namely that 
magnetic flux is frozen into the plasma. Chapter 10 showed that ideal MHD plasmas 
are susceptible to two distinct types of instabilities, pressure-driven and current-driven. 
Pressure-driven modes draw on free energy associated with heavy fluids stacked on top 
of light fluids in an effective gravitational field whereas current-driven instabilities draw 
on free magnetic energy and involve the plasma attempting to increase its inductance in a 
flux-conserving manner. Both of these instabilities occur on the Alfvén time scale defined 
as some characteristic distance divided by v4. 

It is possible for an MHD equilibrium to be stable to all ideal MHD modes and yet not 
be in a lowest energy state. Free energy is therefore available to drive an instability but this 
energy cannot be tapped by ideal MHD modes. Magnetic tearing and associated reconnec- 
tion is a non-ideal instability where the plasma is effectively ideal everywhere except at a 
very thin boundary layer where the ideal assumption breaks down and magnetic fields can 
diffuse across the plasma. Even though the boundary layer is thin, the tearing/reconnection 
causes the overall topology of the magnetic field to change. It is the changing of the topol- 
ogy (the local unfreezing of the magnetic field from the plasma) that allows the configura- 
tion to relax to a lower energy state. Because ideal MHD does not allow the topology to 
change, the lower energy state cannot be accessed via ideal MHD instabilities. The non- 
ideal resistive modes are much slower than ideal modes because the non-ideal mode relies 
on a diffusion or leakage of magnetic field across the plasma in some very limited spatial 
region where ideal MHD breaks down. 

Magnetic tearing is an intrinsically complicated boundary layer phenomenon because 
two very different length scales mutually interact; the mathematical description has to rec- 
oncile behavior in a microscopic thin diffusion layer with behavior in a macroscopic outer 
ideal layer. We shall begin the discussion with an analogy drawn from everyday experience 
and use this analogy to show why it is energetically favorable for a sheet current to break 
up into filaments. The time scale for this process will be estimated and shown to be much 
slower than ideal MHD. Finally, the tearing and reconnection concept will be generalized 
to sheared magnetic fields and helical equilibria. 
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(a) (b) (c) 


contraction to beading 
substrate long single large drop (energetically 
thin of same area favorable) 
drop (energetically 
unfavorable) 


Figure 12.1: (a) Initial long thin drop, (b) contraction into big round drop (too much scrap- 
ing on substrate), (c) beading into chain of little drops. 


12.2 Water-beading: an analogy to magnetic tearing and 
reconnection 


Since magnetic tearing and reconnection is not a simple process, it is helpful to start by 
considering the dynamics of a somewhat analogous instability known from everyday ex- 
perience, namely the process of water beading. The initial condition in water beading is 
shown in Fig.12.1(a) and consists of a long, thin, two-dimensional, incompressible drop of 
water (approximate analog to the magnetic field) frictionally attached to a substrate (ap- 
proximate analog to the plasma). The long thin drop has surface tension ‘trying’ to reduce 
the perimeter of the drop (this is analogous to the pinch force being interpreted as field 
line “tension” squeezing on a current channel). If the drop were not attached to the sub- 
strate, then as shown in Fig.12.1(b), the surface tension would simply collapse the long thin 
incompressible drop into a circular drop having area equal to that of the initial long thin 
drop. Because of the frictional work involved in dragging the water over the substrate, this 
wholesale collapse is not energetically favorable and does not occur. On the other hand, 
if as shown in Fig.12.1(c) the long thin drop breaks up into a line of discrete segments 
(which does not involve significant dragging of water across the substrate), the surface ten- 
sion of each line segment causes each discrete segment to contract in length and bulge in 
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width until circular. Because only modest frictional dragging of water across the substrate 
is required to do this, the process can be energetically favorable, and so result in water 
beading. 


12.3 Qualitative description of sheet current instability 


Suppose an infinite-extent plasma has a thin sheet of current flowing in the z direction 
as shown in Fig.12.2(a). The sheet current is centered at x = O and extends from y = 
—co to y = +00. The current is uniform in both the y and the z directions and roughly 
corresponds to the initial long thin water drop. This somewhat artificial situation can be 
considered as the Cartesian analog of a cylindrical shell current flowing in the z direction, 
localized at some radius r = ro and azimuthally symmetric. Thus, we identify r with x 
and y with 0. 


(a) (b) (c) 
f) fluid 
{ te: flow 
f) vector 
gap ——> 
k I | 
i ae ae center of 
: typical 
typical bar—_, 3 
self field fluid 
vortex 
i) | 
Í filamentation detail of bars 
of current showing fluid 
sheet into bar flow patterns 
current 


cross-sections 


Figure 12.2: (a) Initial sheet current; (b) breaking up into bar cross-section filaments; (c) 
detail showing fluid flow consists of set of small vortices which are antisymmetric in x. 


Since all quantities depend on x only, Ampere’s law reduces to 


ðB 
Hiode(t) = = (12.1) 
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which integrates to give 


B, (x) = B, (0) +f Lodz (x')da’. (12.2) 
0 


On the basis of symmetry the sheet current cannot generate a magnetic field at x = 0 so the 
field B,(0) must entirely due to external currents. Let us assume for now that no external 
currents exist and set B,(0) = 0; the situation of finite B,(O) will be treated later in 
Sec.12.5. Thus B} (x) is a sheared magnetic field which is positive for x > 0 and negative 
for x < 0. The magnitude of B,(x) changes rapidly inside the current layer and becomes 
constant for |x| — oo. 

This situation can be characterized analytically by using a magnetic field 


B,(a) = B tanh(2/L) (12.3) 


where L is the scale width of the current layer. Substituting in Eq.(12.1) gives 


B -2 
J- (x) m cosh ~*(x/L) (12.4) 
which is sharply peaked in the neighborhood of x = 0. 

Suppose that the perturbation shown in Fig.12.2(b) is introduced so that the current 
sheet cross-section is broken up into a number of bar-shaped structures each with current 
Ivar Separated by small gaps in the y direction; these bars are roughly analogous to the wa- 
ter beads discussed above. Each bar’s self magnetic field acts like an elastic band wrapped 
around the bar (analogous to the surface tension of a drop). This tension contracts the y 
dimension of the bar, and if the bar is incompressible, the x dimension will then have to 
grow, as shown in Fig.12.2(c). As the bar deforms from a rectangle into a circle having 
the same area, its perimeter becomes shorter giving rise to a stronger field (and effective 
surface tension) since 


f B-dl =o Ibar- (12.5) 
perimeter 


Hence, this deformation feeds upon itself and is unstable. Note that the inductance of the 
system increases as the current breaks into filaments, consistent with the earlier observation 
that the plasma is always ‘trying’ to increase its inductance. 

This energetically allowed filamentation instability is forbidden in ideal MHD because 
the topology of the magnetic field changes at the gaps between the bars. Before the gaps 
form, the magnetic field lines are open and straight, stretching from y = —oo to y = +00 
whereas after the gaps form, some field lines circle the current filaments; these field lines 
are finite in length and curved. In order to go from the initial state to the filamented state, 
some magnetic field lines must move across plasma in violation of ideal MHD (this change 
in topology is approximately analogous to dragging the water drops across the substrate in 
the water beading problem). The field lines have an ‘X’ shape at the center of the gaps and 
the change in topology occurs at these X-points. The center of the bar is called an O-point 
since the field around this point has an O-shape. 

If some mechanism exists that allows X-points to develop, instability can occur. Since 
finite resistivity allows magnetic field to diffuse across plasma, a resistive plasma will be 
susceptible to this instability at locations where the magnetic field attachment to the plasma 
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is weakest. The location of the weak spot can be deduced by examining Ohm’s law for a 
nearly ideal plasma (i.e., small but finite resistivity), 


E+UxB=nJ. (12.6) 


At most locations in the plasma, the two left hand side terms are both much larger than the 
right hand side resistive term and so plasma behavior is determined by these two left hand 
terms balancing each other. This balancing causes the electric field in the plasma frame to 
be zero, which corresponds to the magnetic field being frozen into the plasma. However, 
if there exists a point, line or plane of symmetry where either U or B vanishes, then in the 
vicinity of this special region, Eq.(12.6) reduces to 


zr. (12.7) 
The curl of Eq.(12.7) gives 
OB 
°? = He (12.8) 
Ot Mo 


a diffusion equation for the magnetic field. Thus, in this special region the magnetic field 
is not frozen to the plasma and diffuses across the plasma. 

Let us now examine the fluid flow pattern associated with the bars as they contract in 
the y direction and expand in the x direction. As shown in Fig. 12.2(c), each bar has 
y-directed velocities pointing from the gap toward the bar center and x-directed velocities 
pointing out from the center of the bar. To complete the incompressible flow there must 
also be oppositely directed x and y velocities just outside the bar with the net result that 
there is a set of small fluid vortices which are anti-symmetric in x and in addition have 
a y-dependence which is 90° out of phase with respect to the current bar y—direction 
periodicity. In particular, there is an outward x-directed velocity at the y location of the 
O-points and an inward x-directed at the y location of the X-points. The fluid motion in 
summary consists of a spatially periodic set of vortices that are antisymmetric with respect 
to x = 0. Each bar has a pair of opposite vortices for positive x and a mirror image pair of 
vortices for negative x so that there are four vortices for each bar. 


12.4 Semi-quantitative estimate of the tearing process 


An exact, self-consistent description of tearing and reconnection is beyond the capability 
of standard analytic methods because of the multi-scale nature of this process. However, 
the essential features (geometry, critical parameters, growth rate) and a reasonable physical 
understanding can be deduced using a semi-quantitative analysis which outlines the basic 
physics and determines the relevant orders of magnitude. The starting point for this analysis 
involves solving for the vector potential associated with the magnetic field in Eq.(12.3), 


obtaining 
T 


A,(x) = -f B,(x')dz' = —BL In [cosh(x/L)] + const. (12.9) 


The constant is chosen to give A, = 0 at x = 0 so 


A,(x) = —BL In [cosh(x/L)}. (12.10) 
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For |x| << L, cosh(x/L) ~ 1+27/L? while for |x| >> L, cosh(a/L) ~ exp(|2|/L)/2. 
Thus, the limiting forms of the vector potential are 


Bu? 
li A, = —— 12.11 
and 


lim A,(x) = -BL (|x| —In2). (12.12) 
jz|>>L 


Near x = 0, A, is parabolic with a maximum value of zero, while far from x = 0, A, is 
linear and becomes more negative with increasing displacement from x = 0. This behav- 
ior of the vector potential is consistent with the field being uniform far from x = 0, but 
reversing sign on going across x = 0. The behavior is also consistent with the relationship 
between the current density and the second derivative of the vector potential, 


foe = OBy = _@A, 
Oe ax Ox? ` 
Thus, current density is associated with curvature in A, (x) or, in more extreme form, with 
a discontinuity in the first derivative of A, (x). Specifying the vector potential is sufficient 
to characterize the problem since the magnetic field and currents are respectively the first 
and second derivatives of A, (x). This general idea can be extended to more complicated 
geometries if there is sufficient symmetry so that specification of an equilibrium flux profile 
uniquely gives both the equilibrium field and the current distribution. 
The reconnection process is characterized by the MHD equation of motion 


(12.13) 


dU 
arg =JxB-VP, (12.14) 
Faraday’s law expressed as 
OA 
E =- — : 
aE? (12.15) 
Ampere’s law 
V x B = wJ, (12.16) 
and the resistive Ohm’s law 
E + U x B =J. (12.17) 


The analysis involves relating the velocity vortices to the linearized Ohm’s law, and in 
particular to its z component 


Faz + Uiz Boy = nz. (12.18) 


The sense of the vortices sketched in Fig.12.2(c) indicate that the velocity perturbation 
is uniform in the z direction and Uj, is antisymmetric with respect to x. Also, since the 
motion consists of vortices, there is no net divergence of the fluid velocity and so it is 
reasonable and appropriate to stipulate that the flow is incompressible with V - U = 0. 
Since the perturbed current density is in the z direction, and since for straight geometries 
the vector potential is parallel to the current density, the perturbed vector potential may 
also be assumed to be in the z direction. Hence, both equilibrium and perturbed vector 
potentials are in the z direction and so the total magnetic field is related to the total vector 
potential by 

B=V x A,Z= VA, x 2. (12.19) 
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Equation (12.18) can be recast using Eqs.(12.15) and (12.19) as an induction equation, 


aA, AAoz 
Uiz 
at 12 Or 


= NJ iz. (12.20) 
The current has the form 


jigs = 2- V x B= 2- V x (V x Ag?) = 2- V x (VA; x 2) = =V7 A, (12.21) 
where the subscript -L means perpendicular to z. Thus, the induction equation becomes 


Aiz Aoz 
H Uir 
ôt ©" Aa 


To proceed, it is necessary to express the perturbed velocity Uiz in terms of Az; this 
relation is obtained from the equation of motion. 

While we could just plow ahead and manipulate the equation of motion to obtain Uiz 
in terms of A,,, it is more efficient to exploit the incompressibility relation. In two- 
dimensional hydrodynamics, incompressibility simplifies flow dynamics so that flow is 
described by two related scalars, the stream-function f and the vorticity Q. For two- 
dimensional motion in the x — y plane of interest here, the general incompressible velocity 
can be expressed as 


Ae. (12.22) 
Ho 


U=Vfx2 (12.23) 
since 
V-U=V.(Vfx2)=2:-VxVf=0. (12.24) 
The vorticity is the curl of the velocity and because the velocity lies in the x — y plane, the 
vorticity vector is in the z direction. The vorticity magnitude 2 is given by 


Q=2-VxU=2.Vx(Vfx2)=V:[(Vf x2) x2] =-Vv?} f (12.25) 


where | means perpendicular with respect to z. Given the vorticity, f can be found by 
solving the Poisson-like Eq.(12.25), and then knowing f, the velocity can be evaluated 
using Eq.(12.23). Appropriate boundary conditions must be specified for both f and Q; 
these boundary conditions are that the vorticity is antisymmetric in x and is only large in 
the vicinity of x = 0 as indicated in Fig.12.2(c). 

The curl of the equation of motion provides the vorticity evolution and also annihilates 
V P; this elimination of P from consideration is why the vorticity/stream-line method is a 
more efficient approach rather than direct solution of the equation of motion. 

Let us now solve for Uj, following this procedure. The linearized equation of motion 
is 


OU- 
Po = (J x B), — VP, (12.26) 
taking the curl and dotting with Z gives 
on 
Poa = 2-Vx(JxB), 


= 2-Vx[J,2x (VA, x 4), 
4 V x (JVA) 
- (VJ; x VAz),- (12.27) 


lI 
NY 


| 
X> 
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Using Eq.(12.21) this can be written as 

on, 1 

—+ = — 2. [V4 x V (V3 A)|,- (12.28) 

Ot Hopo ; 
From Fig.12.2(c) it is expected that the vortices have significant amplitude only in the 
vicinity of where the current bars are deforming and that at large |x| there will be negli- 
gible vorticity. Thus, it is assumed that the vorticity evolution equation has the following 
behavior: 
1. Inner (tearing/reconnection) region: Here it is assumed that the perturbation has much 
steeper gradients than the equilibrium so 


IV (V? Azo) | 2 IV (Vi Aa) | 


12.29 
|V Ao] |VAzl] ( ) 
This allows Eq.(12.28) to be approximated as 
On, 1 a 2 
— x Z-|VA.z0 x V (VIA: 
a mam ae 
1 dA.o O 2 
= — (Vi A, (12.30) 
Hopo da Oy ( 1) 


which shows that Jı, crossed with Byo generates vorticity. Since Byo is antisym- 
metric with respect to x, the vortices have the assumed antisymmetry. Furthermore, 
because J;, is symmetric with respect to x and localized in the vicinity of x = 0 the 
vortices are localized to the vicinity of x = 0. 


2. Outer (ideal) region: Here it is assumed that Qı ~ 0, so Eq.(12.28) becomes 


dA O 2 Azı d?Az0 

dx By | L 21) — Oy dr? 
which is a specification for A, in the outer region for a given Azo. Thus, it is effec- 
tively assumed that the outer perturbed field is force-free, i.e., (J x B); = O so that 
no vorticity is generated in the outer region. 

The perturbed quantities will now be assumed to have the space-time dependence 


=0 (12.31) 


Azı = An(x)eiker 


Qui =O) (x)eikvt74 (12.32) 
so that Eq.(12.30) gives the inner region vorticity as 
1 dA, 1 dA. 
lik (V? Aa) = -— 2 ikJa. (12.33) 


"Lge dat Yo dat 
This satisfies all the geometric conditions noted earlier, namely the antisymmetric depen- 
dence on 2, the localization near x = 0 and, consistent with Fig.12.2(c), a periodicity in y 
that is 90° out of phase with the periodicity of J,,. 
Using Eq. (12.23), it is seen that 


OKs ik fy. (12.34) 


Uir = 
1 ay 
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The stream-function f is a solution of the Poisson-like system Eq.(12.25) and Eq.(12.33), 


O fy 2 1 dA 
— k? fi = — ik Jet. 12. 
Ax fi TAT ikJz1 (12.35) 


Since the perturbed current peaks at x = O and has a width of the order of €, it may be 
characterized by the Gaussian profile 


A 2 2 
Ja x oe 12.36 
1 ST ( ) 
where 
a= f Ja adz (12.37) 
layer 


is the total perturbed current in the tearing layer. The gradient of the vector potential can 
be written as 


dA.o T 
age = —Byo(x) = = 0" (12.38) 
Assuming that the tearing layer is very narrow gives 
2f 
f os ee, (12.39) 
Ox? 
so that Eq.(12.35) becomes 
2f B; À ikBig àc d 
f TREE E E A eee (12.40) 
Ox yLpo 6/7 2y Lpo yT dx 


The profiles of J,1, Byo(x) and their product (right hand side of Eq.(12.40)) are shown in 
Fig. 12.3. 
Integrating Eq.(12.40) with respect to x gives 


Of, ikBio Xa -z Je 
= e 
Ox  2yLpo yT 


(12.41) 


which incidentally gives U1y = —Of,/Ox. Since it is desired to find the magnitude of U;, 
in the region x ~ €, a rough ‘order of magnitude’ integration of Eq.(12.41) in this region 
gives 


f Large (x) f (12.42) 
1° a SIGN 2) Tor TNE x 
2L VT Po 
and so 
Bre 
Ua f sign(x) for £ ~ €. (12.43) 
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[a Bx 


product J1,B' x 
ere = vorticity source 


Figure 12.3: Product of symmetric perturbed current with antisymmetric equilibrium field 
results in antisymmetric vorticity source localized near x = 0. 


The amplitude factor À can be expressed using Eq. (12.21) as 


Ae SS V? Aida 
Ho 
layer 
1 ee 
x —-— z dz 
Ho Ox 
layer 
1 OA: OA\, 
l 1 (12.44) 
Ho ox j, Ox )_ 
where the subscripts + mean evaluated at x = +e. For purposes of joining to the outer 
ideal solution, the normalized jump derivative is defined as 
Aiz Aiz 
Neat wee (12.45) 
E A,,(0) ; 
so that 
A! 
àA = —— A,- (0). (12.46) 
. Ho 
The velocity becomes 
k?BA' e 
Ua “Ay, (0)sign(z) (12.47) 


2y L/T Mo Po 


370 Chapter 12. Magnetic reconnection 


and using Eq.(12.36), the current density becomes 


A! 
Jy, ~ —-——= Ay, (0). 12.48 
t ae Aaa) (12.48) 
We now repeat the induction equation, Eq.(12.20), 
Aiz Aoz 
Uir =z 12.49 
aL le Bp NAL ( ) 
and substitute for Uiz, Ji, and assume that 0Ap,/Ox = —Byo ~ —Be/L in the tearing 
layer. This gives 
k? B2A'e3 B nA’ 
ci QyL?./TiUoPyp > EL (12.50) 
#1 #2 #3 


where the terms have been numbered for reference in the following discussion. 

In the ideal plasma limit, terms #1 and #2 balance each other while term #3 is small; 
this gives the frozen-in condition. At exactly x = 0, term #2 vanishes and so terms #1 
and #3 must balance each other, resulting in diffusion of the magnetic field. At the edge of 
the tearing layer, which is the transition from the ideal limit to the diffusive limit all three 
terms are of the same size. Thus, the three terms may be equated; this gives two equations 
which may be solved for y and € with A’ as a parameter. Equating terms #1 and #3 gives 

nA' 


v= (12.51) 
ELT 


while equating terms #2 and #3 gives 


kB!) et 
y= ABBE (12.52) 
2nPo 
where B’ = B/L is the derivative of the equilibrium field at x = 0. Equating these last 


two equations to eliminate y gives the width of the tearing layer to be 


on2p, A! 1/5 

ee ana (12.53) 
po VF (KB") 
Substituting € back into Eq.(12.51) gives 
3/5 ete 

kB 

y = 0.55 (A')*/* A (eee (12.54) 
Ho PoHo 


This result can be put in a more transparent form by defining characteristic times for ideal 
processes and for resistive processes. The characteristic time for ideal processes is the 
Alfvén time 7 4, defined as the time to move the characteristic length L when traveling at 
the Alfvén velocity, i.e., 


L L Pollo ` 


-1_ va _ VB?/Poto _ /(B? (12.55) 
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The Alfvén time is the characteristic time of ideal MHD and is typically a very fast time. 
The characteristic time for resistive processes is the time T p, defined as the time to diffuse 
resistively a distance L, 


-1_ 7) 
Tp = Ta (12.56) 
For nearly ideal plasmas the resistive time scale is very slow. Using these definitions, 
Eq.(12.54) can be written as (Furth, Killeen and Rosenbluth 1963) 


y = 0.55 (A'L)*? (kL)? rR 5ra (12.57) 


All that is needed now is A’ . This jump condition is found from Eq.(12.31) which gives 
the form of A, in the ideal region outside the tearing layer. This can be expressed as 


2 
V2 Aa + [B - a 
which shows that the equilibrium magnetic field acts like a ‘potential’ for the perturbed 
vector potential ‘wavefunction’. If boundary conditions are specified at large |x| for the 
perturbed vector potential, then in general, there will be a discontinuity in the first derivative 
of A,, at x = 0; this discontinuity gives A’. The jump depends on the existence of a 
localized equilibrium current since 


An =O0 (12.58) 


d? Byo dJz0 
dz? "° dr ` 

In general the outer equation must be solved numerically. 

The main result, as given by Eq.(12.57), is that if A’ > 0 an instability develops having 
a growth rate intermediate between the fast Alfvén time scale and the slow resistive time 
scale. Since a nearly ideal plasma is being considered, 7 is extremely small. The width 
of the tearing layer is therefore very narrow, since as shown by Eq.(12.53), this width is 
proportional to 7?/°. 


(12.59) 


12.5 Generalization of tearing to sheared magnetic fields 


The sheet current discussed above can occur in real situations but is a special case of the 
more general situation where the equilibrium magnetic field does not have a null, but in- 
stead is simply sheared. This means that the equilibrium magnetic field is straight, has 
components in both the y and z directions, and has direction that is a varying function of 
x. The sheared situation thus has a uniform magnetic field in the z direction and instead 
of the current being concentrated in a sheet, there is simply a non-uniform B,o(2). In this 
more general situation the equilibrium magnetic field has the form 


Bo = Byo(x)§ + B202. (12.60) 


A nontrivial feature of this situation is that unlike the previously considered sheet current 
equilibrium, here B,o(«) does not vanish at any particular x. Instead, as will be seen later, 
what counts is the vanishing of k - Bo. Equation (12.60) can be used as a slab representa- 
tion of the straight cylindrical geometry equilibrium field 


Bo = Vuiq(r) x Vz + B-02; (12.61) 
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which in turn can be thought of as the straight cylindrical approximation of a toroid with z 
corresponding to the toroidal angle. 
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Figure 12.4: Tilted coordinate system for general sheared field. 


Figure 12.4 shows the magnetic field given by Eq.(12.60) as viewed in the x = 0 plane. 
In the sheet current analysis discussed in the previous section, the perturbed vector potential 
pointed in the z direction and was periodic in the y direction. This corresponded to having 
k- A, = 0 so that the wavevector was orthogonal to the vector potential and both were 
orthogonal to x. Other important properties were that k - B vanished at the reconnection 
layer, the fluid vorticity vector was pointed in the z direction, the perturbed currents and 
perturbed magnetic fields were such that Jı /Jo >> B,/Bo in the reconnection layer, and 
Jı/Jo ~ Bı/Bo in the exterior region. 

These relationships and approximations are generalized here and, in particular, it is 
assumed that all perturbed quantities have functional dependence ~ g(x) exp(ikyy+ikzz+ 
yt). As before, the vorticity equation is the curl of the linearized equation of motion, i.e. 


OQ 
Pa = V x (J,xBo+JoxBz1) (12.62) 
and in the reconnection layer where J, /Jo >> B,/Bo this becomes 
OQ, 
— = Vx (J,xB 
P at x (Ji xBo) 
= Bo-VJ1—Ji-VBo (12.63) 


since V- Bo = O and V - J; = 0. 

An essential feature of the reconnection topology is that the vorticity must be anti- 
symmetric about the reconnection layer, as can be seen from examination of the fluid flow 
vectors in Fig.12.2(c). Since the vorticity is created by the torque (i.e., the curl of the force), 
it is clear that the torque must be antisymmetric about the reconnection layer. As will be 
seen in the next paragraphs, the condition that the torque is antisymmetric does not imply 
that either Bo or J, are antisymmetric, but rather implies some more subtle conditions. 
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Thus, if x = 0 is defined to be the location of the reconnection layer, then Q, must 
be an odd function of x and therefore must vanish at x = 0. Hence the right hand side of 
Eq.(12.63) must vanish at x = O and since there need be no particular functional relation- 
ship between Bo and its gradient, the two terms on the right hand side of Eq.(12.63) must 
separately vanish. Therefore one of the requirements is to choose the origin of the x axis 
such that Bo - VJ; = 0 at x = 0 or, equivalently, 


k-Bo =Oatx=0 (12.64) 


so that 
ky Byo(0) + kz Bzo = 0. (12.65) 

Having k - B vanish at the reconnection layer is physically reasonable, since finite k - B 
implies a periodic bending of the equilibrium field; such a bending absorbs energy and so 
is stabilizing. Having k - B vanish is like letting the instability cleave the system at a weak 
point so that the instability can develop without requiring much free energy. 

The second requirement for the right hand side of Eq.(12.63) to vanish is to have 
Jı -VBo = 0. If the flow is incompressible, then the perturbed magnetic field must 
be orthogonal to the equilibrium magnetic field and since the perturbed current is the curl 
of the perturbed magnetic field, the perturbed current must be parallel to the equilibrium 
magnetic field. Since Bo is assumed straight, Bo - VBo = 0 and since J; is parallel to Bo, 
it is seen that Jı - VBo = (J1/Bo) Bo - VBo = 0. Because J, is parallel to Bo, Jı must 
also be straight and so a Coulomb gauge vector potential will be parallel to Jı, since 


loli = VXVx Ay 
= VV Ag V°A 
= =V7Ay (12.66) 
= =(V7Ay (12.67) 


can be satisfied by having both J, and A, in the direction of ¢ where ¢ = Bo(0)/Bo(0) 
does not depend on position. 
It is therefore assumed that A, is parallel to Bo(0) and so 


Ai(z,y, 2, t) = Are(x)e kuy tikzztyt, (12.68) 


Thus, k - A, = 0 in this tilted coordinate system and also V - A; = 0 so that the Coulomb 
gauge assumption is satisfied. 

Reconsideration of the right hand side of Eq. (12.63) shows that although the two terms 
both vanish at x = 0 there is a difference in the geometrical dependence of these terms. In 
particular 

Jı- VBo=0 
is true for all x because J, is in the ¢ direction and Bo is a function of x only. Thus, Eq. 
(12.63) reduces to 


ON, 


ot 


Bo: VJ 
i(k- Bo) Jy (12.69) 


where, by assumption, k - Bo vanishes at x = 0. 
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Continuing this discussion of the ramifications of the antisymmetry of k - Bo about 
x = 0, we now define an artificial reference magnetic field B that is parallel to the real 
field at x = O, but has no shear (i.e. has no x dependence). The reference field therefore 
has the form 
B =B,0(0)9 + B202 = Bo(0)¢ for all values of z. (12.70) 
We now define bg as the difference between the real field and B so that 


Bo(z) = B + bo(z) (12.71) 


and bo (a) has the same x-dependence as the Byo field used for the sheet current instability 
in the previous section in that (i) it is antisymmetric about x and (ii) is in the y direction. 
One way of thinking about this is to realize that bo (2) is the component of Bo (a) which is 
antisymmetric about x = 0. With this definition 


k- Bo =k- bo(x) = kybo (12.72) 


and so Eq.(12.69) becomes 

yeQic = ikybo Jic. (12.73) 
This equation provides the essence of the dynamics. It shows how the component of the 
magnetic field that is antisymmetric about x = 0 creates fluid vortices that are antisymmet- 
ric with respect to x. In particular, since bo(x) is an odd function of x and Jı¢ is an even 
function of x, Qy¢ is an odd function of x. 

The complete self-consistent description is obtained by in addition taking into account 
the induction equation which shows how fluid motion acts to create perturbations of the 
electromagnetic field. 

Since we are working with vector potentials, Ohm’s law provides the induction equa- 
tion. In the sheet current case the component of Ohm’s law in the direction of symmetry 
of the perturbation was considered, i.e., the component in the z direction. Here, the cor- 
responding symmetry direction for the perturbation is the ¢ direction and so the relevant 
component of Ohm’s law is the ¢ component, 


¢- [E1 + Ui x Bo(x)] = ¢-751 (12.74) 
which becomes aA 
-2 + (g x C- Ui) bola) = nic. (12.75) 


Since the vorticity vector lies along ¢ the incompressible velocity must be orthogonal to ¢ 
and so has the form 
U =Vfi x ¢. (12.76) 
Thus 5 f : 
Gxt: UL= (axt) (VA x?) = iky fı. (12.77) 
Again, taking into account the fact that the vorticity points in the ¢ direction it is seen that 
Qı = eg where 
Qe=- V xU =-V? fi (12.78) 
and now | means perpendicular to a 
Substituting for Qi¢ in Eq.(12.73) using Eq. (12.78) gives 


B ikybo(x) Jic f 


Vine 
va. YPo 


(12.79) 
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this equation provides the essential dynamics of fluid vortices that are antisymmetric with 
respect to x and driven by the torque associated with the non-conservative nature, i.e., curl, 
of the J x B force. This is essentially the same as Eq.(12.35) and the rest of the analysis is 
the same as for the sheet current problem except that now ky is used instead of k and bg is 
used instead of Bg,,. Since bo(x) only differs from Byo(x) by a constant, bg = Bio. Thus, 
using Eq.(12.54) the growth rate will be 


re 
(ky Blo)” ] 


Polo 


3/5 
y = 0.55 (A) | 4 | (12.80) 


Ho 


The global system has to be periodic in both the y and z direction in order for well-defined 
ky and k, to exist. In particular, the physical arrangement and dimensions of the global 
system determine the quantized spectra of k, and k, and so determine the allowed planes 
where k -Bo can vanish. As suggested earlier, the allowed planes can be considered 
as ‘cleavage’ planes where the magnetic field can most easily become unglued from the 
plasma. 

Let us express this result in the context of toroidal geometry such as that of a tokamak. 
This is done by letting 6,9 correspond to the toroidal field By and By correspond to Bg 
the poloidal field. The Alfvén time is now defined in terms of By as 


-1 Bo 
Ta = 
4,/PoHo 


where a is the minor radius. The safety factor, a measure of the twist, is defined as 


(12.81) 


aBy 
= 12.82 
I= RB, ( ) 
so that 3 
aDo 
Bo = E 12.83 
a Rg ( ) 
and as 
ah¢ 
By zo (12.84) 
Thus it is possible to replace ky — m /a and 
(kyBio) fma \ 1 
L ( > ‘) : (12.85) 
Polo Rq TA 
At the tearing layer k - B = 0 or 
m n 
— Bọ + — B4 = 0 12.86 
z Pe + Ree ( ) 
so oa 
q=-— (12.87) 
n 
and Eq.(12.85) becomes 


ky Bio)” Ne 
(KyByo) ($2) 3 (12.88) 
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Thus, Eq.(12.80) becomes 


2 N 2/5 
y =0.55 (A'a) rR r ee (12.89) 
where n 
a 
ea (12.90) 


In the form given by Eq.(12.89), it is seen that the essential source for the tearing instability 
is q'. From this point of view, the ‘free energy’ is in the gradient of q and so as the tearing 
mode uses up this free energy, the q profile will be flattened. 


y 


a 7 


Figure 12.5: Surfaces of constant A, showing magnetic islands and separatrix with width 
w. 


12.6 Magnetic islands 


The tearing instability changes the topology of the magnetic field and causes the formation 
of magnetic ‘islands’ (Bateman 1978). The equilibrium magnetic field has the form 
B = B+B, (x)ý = B+2Blg = B+VA,0(2) x 2 (12.91) 


where 
xB! 
z` 


A(x) = (12.92) 
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It is seen that 
Bi -VAa =0 (12.93) 


where B | is the component perpendicular to z. Thus, the surfaces A, = const. give the 
projection of the field lines in the plane perpendicular to z; these projections correspond to 
the poloidal flux surfaces in toroidal geometry. 

If it is assumed that k, << ky, so that ¢ is very nearly parallel to z, then it is seen that 
the tearing instability adds a perturbation to A, so that now 


2B! 
A,(2,y) = 2 5 + Asi 008 ky. (12.94) 


A sketch of a set of surfaces of constant A, (x, y) is shown in Fig.12.5. These surfaces 
consist of (i) closed curves called islands, (ii) a separatrix which passes through the x point, 
and (iii) open outer surfaces. The width w of the separatrix can be calculated, by noting 
that at the x—point 


Asep =0+ An (12.95) 


while at the point of maximum width, x = w/2, 


2)” B! 
ot Ms As (12.96) 


2 
; Azı 
T ay B! 


Equating these gives 


But 
kyB! 


(12.97) 


~ 


using B,, = ik,A,, In tokamak terminology we identify B,,; — B,, where r is the minor 
radius, and using Eq.(12.84), (12.87) and ky — m/a this becomes 


Rq Bry 
nq’ By 


. (12.98) 


It is important to realize that the width of the island is much larger than the width of the 
tearing layer. Since particles tend to be attached to magnetic flux surfaces, the formation 
of islands means that particles can circulate around the island, thereby causing a flattening 
of the pressure gradient because the pressure is constant along a magnetic field line. 
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12.7 Assignments 
(b) 
current : 
loop poloidal X-point X-point 
flux surface 
Vout Ogi 


current sheet 


Figure 12.6: (a) Two identical coaxial current loops; (b) current loops approaching each 
other; (c) detail of X-point region of (b) showing inflows/outflows and current-sheet 


(shaded region) having nominal length L and thickness ô. 


1. Sweet-Parker type reconnection(Sweet 1958, Parker 1957): Consider the two identical 
flux-conserving current loops shown in Fig. 12.6(a). Because the system is axisym- 


metric, the magnetic field can be expressed as 


1 
B =— Vy x V¢. 
20 


(a) Explain why there is an attractive force between the current loops (hint: consider 
the force between parallel currents or between north and south poles of two 


magnets). 


(b) 


(c) 


(d) 


(e 


wm 
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Define private flux to be a flux surface that links only one of the current loops 
(examples are the flux surfaces labeled 1 and 2 in Fig.12.6(a)). Define public 
flux to be a flux surface that links both of the current loops (examples are flux 
surfaces 3,4,5 in Fig. 12.6(a)). Define the X-point to be the location in the z = 0 
plane where there is a field null as shown in Fig.12.6(a); let ro be the radius 
of the X-point. Show by sketching that as the two current loops approach each 
other in vacuum, a private flux surface above the midplane will merge with a 
private flux surface below to form a public flux surface. 


Show that the flux linked by a circle in the z = O plane with radius ro (i.e., 
the circle follows the locus of the X-point) is the public flux. Argue that if the 
current loops approach each other, this public flux will increase at the rate that 
private flux is converted into public flux. By integrating Faraday’s law over 
the surface of this circle in the z plane with radius ro, show that there exists a 
toroidal electric field at the X-point, 


O 1 pubiic 
2779 ot 


where OY pubiic/ Ot is the rate of increase of public flux. 


Eg = 


Now suppose that the vacuum is replaced by perfectly conducting plasma so that 
plasma is frozen to field lines. Show from symmetry that the plasma velocity at 
the x point must vanish. What is Fg at the X-point if the two current loops are 
immersed in a plasma which satisfies the ideal Ohm’s law, 


E+UxB=0. 


Can there be any conversion of private flux into public flux in an ideal plasma? 
Sketch contours of private and public flux when two current loops approach each 
other in an ideal plasma. 


Now suppose the two current loops are immersed in a non-ideal plasma which 
satisfies the resistive Ohm’s law 
E +U x B=yJ. 


Can private flux be converted into public flux in this situation, and if so, what is 
the relationship between the rate of increase of public flux and the resistivity? 


By writing Eg, Br, Bz, and Jọ in terms of the poloidal flux function show that 
the toroidal component of the resistive Ohm’s law can be expressed as 


Ow Heh nes n 3 
At 4U-Vy=r-V (ive) p 
Show that this equation implies that flux convects with the plasma if 7 = 0, but 
diffuses across the plasma if U = 0. Show that the diffusion increases if 7 devel- 
ops a steep gradient at the location where U = 0. By considering your response 
to (d) above, discuss how the flux gradient in the z direction might steepen in the 
vicinity of the X-point. Taking into account the relationship between J and 4, 
show that this corresponds to developing a thin sheet of current in the midplane 
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(current-sheet) as shown in Fig.12.6(c). Show that in order to have an X-point 
geometry and also satisfy Faraday’s law, the flux function must have the form 


z (r = ro)? 
28°? 2L? 


V(r, z,t) = (1 ) Po — 2rr Egt 

in the vicinity of the X-point, here Yọ is the flux at the X-point at t = O and ro 
is the radial location of the X-point. What is the relationship between 6 and L in 
vacuum and ina resistive plasma? Is 6 larger or smaller than L if the two current 
loops are approaching each other? If ô < L is there a current in the vicinity of 
the X-point? What is the direction of this current with respect to the currents in 
the two current loops? Will the force on the current loops from the current in 
the current sheet accelerate or retard the motion of the two current loops towards 
each other? What is the direction of Hg? Is ~ increasing or decreasing at the 
X-point? 


Suppose that the current sheet has a nominal thickness of 6 in the z direction 
and a nominal width of L in the r direction as shown in Fig.12.6(c). Let Uin 
be the nominal vertical velocity with which the plasma approaches the x point 
and let Usut be the nominal horizontal velocity with which the plasma leaves the 
x point region as shown in Fig.12.6(c). If the plasma motion is incompressible 
show that 

UinL ~ Uoutd. (12.99) 


Consider the transition from the ideal MHD form of Ohm’s law to the form at 
the X-point. Argue that in the region of this transition the terms U x B and nJ 
should have the same order of magnitude (this is essentially the same argument 
that was used to analyze Eq.(12.49)). Use this result and Ampere’s law to find a 
relationship between nJ, Bin and 6. Use this to show 


Um ~N [bod (12.100) 


and explain this result in terms of the convective velocity for motion of flux 
surfaces outside the current sheet and the "diffusive velocity" for motion of flux 
surfaces inside the current sheet. 


Using Eq.(9.42) show that the MHD force acting on the plasma is 


Vu 1 
Fyap= © V. ( vv) 
(277)” Ho 


Sketch the direction of this force and indicate where this force is finite (hint: 
consider where J is finite). Estimate the work f F-dl done on plasma accelerated 
through the current sheet by using relationships involving Bin, Bout, 6, and L. 
Use this estimate to show that 


Uout ® VA in (12.101) 


where U4,in = Bin/./Hopis the Alfvén velocity at the input. (Hint: Let Ay be 
the jump in flux experienced by a fluid element as it moves across the current 


Q) 


(k 


a 
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sheet. Note that Bout ~ Aw/2mrL and Bin ~ Ay/2rrô; also note that V - 
(r=?Vy) can be expressed in terms of Bin and ô.) 


By eliminating U;,, and Usut between Eqs.(12.99), (12.100), and (12.101) show 
that 
ô Un _— 1 


L Va,in V Sin 


where the inflow region Lundquist number S'in is defined as 


Fa = LgVa,inL 
n 


If the initial separation between the two current loops is also of the order of 
L, how much time is required for the two loops to merge, thus completing the 
reconnection process? Structures in the solar corona have scale lengths L ~ 
1-100 megameters (108 m) and it is observed that Alfvén wave disturbances 
propagate with velocities v4 ~ 10° m/s. The temperature of these structures is 
10-100 eV. Assuming a nominal temperature of 50 eV, use Spitzer resistivity to 
calculate the nominal value of Sin. How long would it take for two solar current 
loops to merge if they are governed by the mechanism discussed here (what is 
the appropriate time unit to use seconds, days, weeks, years, ...?). Actual current 
loops change topology on time scales ranging from minutes to hours — how does 
this compare with the predictions of this model? 
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Fokker-Planck theory of collisions 


13.1 Introduction 


Logically, this chapter ought to be located at the beginning of Chapter 2, just after phase- 
space concepts had been introduced. The reason this chapter is not located there is that 
the theory in this chapter is too involved to be at the beginning of the book and would have 
delayed the introduction of important other topics that do not need the detail of this chapter. 

The discussion of collisions in Chapter 1 was very approximate. Collisions were shown 
to scale as an inverse power of temperature, but this was based on a “one size fits all” 
analysis since it was assumed that the collision frequency of both slow and fast particles 
were nominally the same as that of a particle going at the thermal velocity. Because the 
collision frequency scales as v73, it is dubious to assume that the collision rates of both 
super-thermal and subthermal particles can be reasonably represented by a single collision 
frequency. A more careful averaging over velocities is clearly warranted. This is provided 
by a Fokker-Planck analysis due to Rosenbluth, Macdonald and Judd (1957) but if all 
this analysis provided was more accuracy for collision calculations, it would not be worth 
the considerable effort except possibly for exceptional situations where high accuracy is 
important. However, it turns out that more than increased accuracy results from this theory, 
because some new and important phenomena become evident from the analysis. 

We begin the analysis by reviewing collisions between two particles. Suppose a test 
particle having mass my and charge qr collides with a field particle having mass mp and 
charge qr. Since the electric field associated with a charge q is E = fq/4reor? where r 
is the distance from the charge, the equations of motion for the respective test and field 
particles are 


z qT IF 
13.1 
wee lero re Saas 
mpip TAR Gee, (13.1b) 


4reolrr — rel? 


The center of mass vector is defined to be 


= MTT + MPP (13 2) 


MT TMF 
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Adding Eqs.(13.1a) and (13.1b) gives 
(mp +mp)R = mrër + mrpr =0 (13.3) 


showing that the center of mass velocity R is a constant of the motion and so does not 
change as a result of the collision. If Atr and Arp are defined as the change in respective 
velocities of the two particles during a collision, then it is seen that these changes are not 
independent, but are related by 


(mp +mp)AR = 0 = mrAtr + mrâtr. (13.4) 
It is useful to define the relative position vector 
r=rr—rfpF (13.5) 
and the reduced mass 
1 1 1 
Sann: (13.6) 


H T mT MBE 
Then, dividing Eqs.(13.1a) and (13.1b) by their respective masses and taking the difference 
between the resulting equations gives an equation of motion for the relative velocity 


„n TIF . 
= ; 13.7 
a Ategr? Get) 
Solving Eqs.(13.2) and (13.5) for ry and rp gives 
rr = R-&r (13.8) 
Mp 
rrp = R+ shee, 
MT 
and so the respective test and field velocities are 
ip El ees (13.9) 
MF 
ir = R+ Et. (13.10) 
MT 


Since AR = 0 the change of the test and field particle velocities as measured in the lab 
frame can be related to the change in the relative velocity by 


Atp = -At (13.11) 
mp 

Air = +E At. (13.12) 
MT 


The collision problem is first solved in the center of mass frame to find the change in 
the relative velocity and then the center of mass result is transformed to the lab-frame to 
determine the change in the lab-frame velocity of the particles. 

Let us now consider a many-particle point of view. Suppose a mono-energetic beam of 
particles impinges upon a background plasma. Several effects are expected to occur due 
to collisions between the beam and the background plasma. First, there will be a slowing 
down of the beam as it loses momentum due to collisions. Second, there should be a 
spreading out of the velocity distribution of the particles in the beam since collisions will 
tend to diffuse the velocity of the particles in the beam. Meantime, the background plasma 
should be heated and also should gain momentum due to the collisions. Eventually, the 
beam should be so slowed down and so spread out that it becomes indistinguishable from 
the background plasma which will be warmer because of the energy transferred from the 
beam. 
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13.2 Statistical argument for the development of the 
Fokker-Planck equation 


The Fokker-Planck theory (Rosenbluth et al. 1957) is based on a logical argument on how a 
distribution function attains its present form due to collisions at some earlier time. Consider 
a particle with velocity v at time t that is subject to random collisions which change its 
velocity. We define F'(v, Av) as the conditional probability that if a particle has a velocity 
v at time t, then at some later time t + At, collisions will have caused the particle to have 
velocity v + Av. 

Clearly at time t + At the particle must have some velocity, so the sum of all the 
conditional probabilities must be unity, i.e., F must be normalized such that 


J Fo. Avyaay =1. (13.13) 


This definition of conditional probability can be used to show how a present distribution 
function f(v,t) comes to be the way it is because of the way it was at time t — At. This 
can be expressed as 


f(v,t) = [to-avt — At) F(v—Av, Av)dAv (13.14) 


which sums up all the possible ways for obtaining a given present velocity weighted by 
the probability of each of these ways occurring. This analysis presumes that the present 
status depends only on what happened during the previous collision and is independent 
of all events before that previous collision. A partial differential equation for f can be 
constructed by Taylor expanding the integrand as follows: 


f(v—Av,t— At)F(w—Av, Av) = f(v,t)F(v, Av)-AtL rw, Av) 
ð 
Av (f(v,t)F(v, Av)) 
1 ð ə 
+5 AvAv ay Ov. (f(v,t)F(v, Av)). 
(13.15) 
Substitution of Eq.(13.15) into Eq.(13.14) gives 


f(v,t) = flv, t) f F(v, Avjadv— art f Flv, Ava 
- fave (f(v,t)F(v, Av)) dAv 
+5 fJ AvAv ae (f(v,t)F(v, Av)) dAv 
(13.16) 


where in the top line advantage has been taken of f(v,t) not depending on Av. Upon 
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invoking Eq.(13.13) this can be recast as 


ð (fs AvF(v, Av)dAv 


oP ON At n (13.17) 
ð | 1ə ð, LAVAVEV AVAN í ' 
' 2 ðv ðv At ao 
By defining 
Av \ _ [J AvF(v, Av)dAv 
(= )- N (13.18) 
and 
AvAv\ _ J AvAvF(v, Av)dAv 
= 13.1 
(em, ` (219) 


the standard form of the Fokker-Planck equation is obtained, 


of ð Av , POs ð AvAv 
ot Ov (( ar) ftv.t)) "2 av Ov ` (( At GADE (1929) 


The first term gives the slowing down of a beam and is called the frictional term while the 
second term gives the spreading out of a beam and is called the diffusive term. 

The goal now is to compute (Av/At) and (AvAv/At). To do this, it is necessary 
to consider all possible ways there can be a change in velocity Av in time At and then 
average all values of Av and AvAv weighted according to their respective probability of 
occurrence. We will first evaluate (Av /At) and then later use the same method to evaluate 
(AvAv/At). The problem will first be solved in the center of mass frame and so involves 
a particle with reduced mass ju, speed vre, = |vr — vel, and impact parameter b colliding 
with a stationary scattering center at the origin. This was discussed in Chapter 1 and the 
geometry was sketched in Fig.1.3. The deflection angle 0 for small angle scattering was 
shown to be 


Ges: qT YF 
ae at 
2rEegbuvz. 
The averaging procedure is done in two stages: 

1. The effect of collisions in time At is calculated. The functional dependence of Av on 
the scattering angle 0 is determined and then used to calculate the weighted average 
Av for all possible impact parameters and all possible azimuthal angles of incidence. 
This result is then used to calculate the weighted average change in target particle 


velocity in the lab frame. 


(13.21) 


2. An averaging is then performed over all possible field particle velocities weighted 
according to their probability, i.e., weighted according to the field particle distribution 
function fp (vp). 


Energy is conserved in the center of mass frame and since the energy before and after 
the collision is entirely composed of kinetic energy, the magnitude of v., must be the same 
before and after the collision. The collision thus has the effect of causing a simple rotation 
of the relative velocity vector by the angle 0. Let z be the direction of the relative velocity 


before the collision and let ô be the direction of the impact parameter. If V;-ci1, Vre12 are 
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the respective relative velocities before and after the collision, then 


Vrell = UrelZ 
Vrel2 = Ure COS OZ + Ure; sin Ob. (13.22) 
Thus 
AVrel = Vrel2 — Vrell 


= Ure (cos — 1) Ê + Vpe1 sin Ob 


2 
~ -2A 2 + oO coso + Hsin 6) (13.23) 
where œ is the angle between the impact parameter and the x axis and the scattering angle 
0 is assumed to be small. The fact that the z component is negative reflects the slowing 
down of the particle in its initial direction of motion. 

The cross section associated with impact parameter b and the range of azimuthal impact 
angle dọ is bdddb. In time At the incident particle moves a distance v At and so the vol- 
ume swept out for this cross-section is bddédbv At. The number of relevant field particles 
encountered in time At will be the density fr(vp)dvp of relevant field particles multi- 
plied by this volume, i.e., fr(vr)dvpbdddbvAt and so the change in relative velocity for 
all possible impact parameters and all possible azimuthal angles for a given vp will be 


AVjant b, all 6 = vAtfpr (vp)dve fo Avobdb. (13.24) 


The limits of integration for the azimuthal angle are from 0 to 27 and the limits of integra- 
tion of the impact parameter are from the 90° impact parameter to the Debye length. On 
writing Av in component form, this becomes 


27 AD 
AVreilall b, alọ = UreiAtfr(vr)dvr y aw f Urei (0 coso, O sin o, —6° /2) bdb. 
0 bets 


(13.25) 
The x and y integrals vanish upon integration over ġ and so 
AD 
AVreljall bald = — 2v2 At fp (ve)aven | 0?bdb 
br/2 
AD 2 
2 —ivi,Atfe(vrjaver | (=) bdb 
bx /2 2TE0bUV el 
Ip 
= —ZAtfr (vp )dve mez 2v2_, In A (13.26) 


where A = Ap/br/2- 
Using Eq.(13.12) the above expression can be transformed to give the test particle ve- 
locity change to be 


ITIR OOA 


7 2 2 
ATESUMTUV el 


Avrjait,aig = —2Atfr(vr)dve 


qdq? ln A (vr — ve) 
Anezumrp |vr — vel? 


fr(vr)dvp. (13.27) 
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This result was for a particular value of vp and summing over all possible vp weighted 
according to their probability, gives 


22 
qiq ln A f Vrei 
A , = —At dvr. 13.28 
Vall b, all ġ, all vr ee | fr(vr)dve ( ) 


The integrand can be expressed in a simpler form by considering that 


Or o o 0] r 
= | ĉî— +9 +2 yat ky en 13.2 
or Ge Voy a2) PEE r ee) 


and also 0/Ovr = 0/Ov,i. Thus 


o re re 
tel Neel (13.30) 
ðvr Urel 
and 
ð 1 1 Vrel 
= : 13.31 
OVT Urel es Urel ( ) 
Equation (13.28) can therefore be written as 
25.9 
qqr nA ð 1 
Aviallb. alld, allve = At dvr. 13.32 
Vall b, all ¢, all ve i o E ae fr(vr)dvp ( ) 


The 0/Ovr has been factored out of the integral because vr is independent of vp, the 
variable of integration. Since vp is a dummy variable, it can be replaced by v’ and since 
vr is the velocity of the given incident particle, the subscript T can be dropped. After 
making these re-arrangements we obtain the desired result, 


(=) 5 dq; mA O J; fr(v') yi (13.33) 
| | | 


At/  4režumr Ov J |v —v'| 


The averaging procedure for Av Av is performed in a similar manner, starting by noting 
that 


AvAv = (8 cos $, sind, —6° /2) (0 cos ¢, Osin o, —0° /2) (13.34) 
and so Eq.(13.25) is replaced by 


2m AD 
AVrei AVreijallb, alld = vreiAtfpl(ve)dve | a f bdb x 
0 ba /2 


v2 (0 cos @, Osin p, —0°/2) (0 cos @, Osin p, —0°/2) ; 
(13.35) 


The terms linear in sin ¢ or cos ¢ vanish upon performing the ¢ integration. Also, the 22 
term scales as 0* and so may be neglected relative to the ĉĉ and yy terms which scale as 
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67. Thus, we obtain 


2m AD 
AvretAVretlaib,g = Veet fr(vr)dvr ii do / 0° (ĉĉ cos” o + yy sin” Ø) bdb 
0 ba /2 


AD 
= v>,,Atfr(vr)dvp n(ea +09) | 07 bdb 


bx /2 
2 2 
o py drip mA 
= At d | ; 13.36 
fr(vr)dvr (ĉĉ + yy) dee iea ( ) 
Since v,e; defines the z direction, we may write 
ê + 9g = Eie (13.37) 
Urel 
where I is the unit tensor. Thus, 
2 2 2 
ER IT dE In A Urel — VrelVrel 
AV ret AVreiall b, all ¢ = eg re fr(vr)dvf (eet - (13.38) 
Using Eq.(13.12) this can be transformed to give 
2.9 2 
qT Up InA Vel — VrelVrel 
AvrAv jab. alo = At = 5 d to?! 13.39 
VTÂVT]all b, all ġ E fr(vr)dvp ( E ( ) 
The tensor may be expressed in a simpler form by noting 
o Wrel AS 0 Vrel 
Ove ðvr OVT Vrel 
x I Vrel Vrel 
g Urel va Urel 
2 
I- relVYre 
Se ete ae (13.40) 


Vrel 
Inserting Eq.(13.40) in Eq.(13.39) and then integrating over all the field particles shows 
that 


( AvrAvr ) _ GapinA 

At Arepm?, 
However, vr is constant in the integrand and so 0/Ovr may be factored from the integral. 
Dropping the subscript T from the velocity and changing vpr to be the integration variable 
v’ this can be re-written as 


ð ð 
J orive glr — ve (13.41) 


AvAv gqmA ð ð 
( K \= dent ae = fw v' |fr(v')dv'. (13.42) 
It is convenient to define the Rosenbluth potentials 
gr(v) = [iv -¥'lfe(wav' (13.43) 
1 
hp(v) = a av (13.44) 
u v-v 
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in which case 


AVN gq? n A hp 
At ~ Anem? Ov 
AvA 2q mA 3? 
u ae L (13.45) 
At dmegm> OvOv 
The Fokker-Planck equation, Eq.(13.20) thus becomes 
Ofr Gq¢pinAf[ 2 hp 183 ð gp 
Tar : : . (13.46 
a £. Anemi | Ov fr ðv 2 ðv ðv fr OvOv ( ) 


The first term on the right hand side is a friction term which slows down the mean velocity 
associated with fpr while the second term is an isotropization term which spreads out (i.e., 
diffuses) the velocity distribution described by fr. The right hand side of Eq.(13.46) thus 
gives the rate of change of the distribution function due to collisions and so is the correct 
quantity to put on the right hand side of Eq.(2.12). 


13.2.1 Slowing down 


Mean velocity is defined by 
| vfdv 


u= 


(13.47) 


n 
where n = f fdv. The rate of change of the mean velocity of species T is thus found by 
taking the first velocity moment of Eq.(13.46). Integration by parts on the right hand side 


terms respectively give 
Oh Ooh 
Is (hr ae ) aw =| r dv (13.48) 


a ð o a? oO 
GF GF 
: dv = dv =0 13.4 
Te ðv (4 ro) M T ðv (r35 E) M E 
where Gauss’s theorem has been used in Eq.(13.49). The frst ce moment of Eq.(13.46) 
is therefore ‘ A ‘4 
ee q2? pln hp 
= dv. 13.50 
p? 4regnr ma T fr av V ( ) 
Let us now suppose that the test narices consist of a mono-energetic beam so that 
fr(v) = nrô(v — uo). (13.51) 
In this case Eq.(13.50) becomes 
ður q2 mA /ôhp 
—— = ; 13.52 
Ot a Anezm2 Əv Jou Ue) 


Let us further suppose that the field particles have a Maxwellian distribution so that 


3/2 
fr(v) =np (sas =) exp (—mpv?/2KTp) (13.53) 
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and so, using Eq.(13.44), 


hp(v) = dv’. (13.54) 


me MF J SeT) 


u 2TKTp lv —v'| 


The velocity integral in Eq.(13.54) can be evaluated using standard means (see assign- 


ments) to obtain 
NEMT MF 
h = —— erf |,/ 13. 
F(v) T er ( KT p v) (13.55) 


erf (x) = = L exp(—w”)dw (13.56) 


where 


is the Error Function. 
Thus, Eq.(13.52) becomes 


Our niqqa? nAmr fo] i Mi 
= ae ENT v` erf —v 
ot A4nregmi p;i | Ov 2KT; ae 
2 2 
NeGrGe nAmr fə | _, Me 
f 13.57 
Aree@m2, He 2 ane: KT.” = vee!) 
v=uo 


-1 -1 -1 
where H; e = Mie +Mp . 


This can be further simplified by noting (i) quasi-neutrality implies 


niZqi + NeGe = 0 (13.58) 


where Z is the charge of the ions, (ii) the masses are related by 


Pee, (13.59) 


and (iii) the velocity gradient of the Error Function must be in the direction of ug because 
that is the only direction there is in the problem. Using these relationships and realizing 
that both the left and right sides are in the direction of u, Eq.(13.57) becomes 


~~ 2 


Oise" 
er u 
ðu nee? nA Qe Z 14.2 d 2KT; 
ot Areg mi. 


(13.60) 


Mie 
Sp pees 13.61 
a On Fi (13.61) 


Let us define 
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which is the ratio of the beam velocity to the thermal velocity of the ions or electrons. The 
slowing down relationship can then be expressed as 


ðu nee? mA q3 lz (1 ar | mi d (=) 


Ot Ane? mê mi ) 2KT; d&; E 
MT\ Me d [f(erf(£e) 
t (14 ‘ 13.62 
Foda se 
: fx x 
lima (r) = F 


= (=> ) oat 
xX 


0.377 
0.27 


(A 


Figure 13.1: Plot of —(x~! erf £)’ v. x. 


Figure 13.1 plots — (erf(x)/x)' as a function of z; this quantity has a maximum when 
x = 0.9 indicating that the friction increases with respect to x when z is less than 0.9 but 
decreases with respect to x when x exceeds this value. Examination of this figure suggests 
that the function has a linear dependence for x well to the left of the maximum and varies 
as some inverse power of x well to the right of the maximum. 

This conjecture is verified by noting that Eq.(13.56) has the limiting values 


li f(z) ~ =. fa- ?2\d et BEA 
ean x = Vi w "= Tk x 3 
1 


im, erf(z) ~ (13.63) 
and so 
d erfx 4x 
im ( -2 See 13.64 
neel ( dz x ) 3/0 ter) 
: d erfx 1 
eee vs 


which is consistent with the figure. 

The existence of these two different asymptotic limits according to the ratio of the test 
particle velocity (i.e., beam velocity) to the thermal velocity indicates three distinct regimes 
can occur, namely: 
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1. Test particle is much faster than both ion and electron thermal velocities, that is 
EE. >>. 


2. Test particle is much slower than both ion and electron thermal velocities, that is 
Eise << 1. 


3. Test particle is much faster than the ion thermal velocity, but much slower than the 


electron thermal velocity, i.e., £; >> 1l and é << 1. 


A nominal slowing down time can be defined by writing the generic slowing down 
equation 


ðu u 
see ay a 13. 
ae (369 
so 
u 
= —-—_ 13. 
Ts 5u/at (13.66) 


this can be used to compare slowing down of test particles in the three situations listed 
above. 


Test particle faster than both electrons, ions 
Here the limit given by Eq.(13.64b) is used for both electrons and ions so that the 
slowing down becomes 


pe EP | mes 
Ou nee? In A gi. z(14 m) l (14 m) 


= ; 13.67 
Ot Ane? = mi, u? ( ) 
and so 
Ane? m3 u? 
sa 13.68 
= memi & 7414 iat 


Me 

since 1/Mme >> 1/m;. Equation (13.67) shows that if the test particle beam is composed 
of electrons, the ion friction scales as Z/(Z + 2) of the total friction while the electron 
friction scales as 2/(2 + Z) of the total friction. In contrast, if the test particle is an ion, 
the friction is almost entirely from collisions with electrons. The slowing down time is 
insensitive to the plasma temperature except for the weak In A dependence. The slowing 
down time for ions is of the order of m; /Me times longer than the slowing down time for 
electrons having the same velocity. 

Test particle slower than both electrons, ions 

Here the limit given by Eq.(13.64a) is used for both electrons and ions so that the 
slowing down equation becomes 


Ou u nee? nA qi- g(14 mr Mi ate otis mr Me 3/2 
Ot 3T Te m m; 2KT; l Me Q2KT. 
(13.69) 
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and the slowing down time becomes 


_ 3 fane§ mh 1 


~ nee? In A gÈ alia ™ m; ae mr MEN EN 
i mj; 2KT; i l Me 2KTo 


(13.70) 

The temperature terms scale as the inverse cube of the thermal velocity and so if the 

ion and electron temperatures are the same order of magnitude, then the ion contribution 

dominates. Thus, the slowing down is mainly done by collisions with ions and the slowing 
down time is 


_ 8v ame Mp (2KT; / m)? 
*~ nee? nA q2 zZ prt - 
Mi 
The beam takes a longer time to slow down in hotter plasmas. 
Intermediate case: beam faster than ions, slower than electrons 
In this case, the slowing down equation becomes 


a .e2InA g2 1 aN IE 
See a ie A (ya ig ee | 03.72 
Ot Areg må Mi J u? Me 2KTe 3T 


and the slowing down time is 


MT 4 MT m 3/2) 
egpe2nA| Z(14 1; 14 g 
mapama (z (1E) b+ az (iE Ga) | 


13.3 Electrical resistivity 


(13.71) 


Ts = 


If a uniform steady electric field is imposed on a plasma this electric field will accelerate 
the ions and electrons in opposite directions. The accelerated particles will collide with 
other particles and this frictional drag will oppose the acceleration so that a steady state 
might be achieved where the accelerating force balances the drag force. The electric field 
causes equal and opposite momentum gains by the electrons and ions and therefore does not 
change the net momentum of the entire plasma. Furthermore, electron-electron collisions 
cannot change the total momentum of all the electrons and ion-ion collisions cannot change 
the momentum of all the ions. The only wave for the entirety of electrons to lose momentum 
is by collisions with ions. 

Let us assume that an equilibrium is attained where the electrons and ions have shifted 
Maxwellian distribution functions 


nj Bye. 
fl) ml? (Qn; /m;)?/? = ( Malm] /2KT:) 
Ne 2 M 
fe(v) ea Foe exp ( me (v — ue) /26T.) (13.74) 
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The current density will be 

J =n qu; + NeGeUe- (13.75) 
It is convenient to transform to a frame moving with the electrons, i.e., with the velocity 
u,. In this frame the electron mean velocity will be zero and the ion mean velocity will be 


Ure} = U; — Ue. The current density remains the same because it is proportional to upe; 
which is frame-independent. The distribution functions in this frame will be 


= Ni l 2 a, 
flv) = 13/2 (2KT; /m,)3/2 a ( mi (V — Urei) /2KT;) 
Ne 2 Jop 
fe(v) OAE exp (=mev? /2KTe) . (13.76) 


Since the ion thermal velocity is much smaller than the electron thermal velocity, the 
ion distribution function is much narrower than the electron distribution function and the 
ions can be considered as a mono-energetic beam impinging upon the electrons. Using 
Eq.(13.57), the net force on this ion beam due to the combination of frictional drag on 
electrons and acceleration due to the electric field is 


ðure nede mnAf af i Me 
; — Meh Ge f iE. 13.77 
miat Anezu, \ 3v ie S IRT, mae a (132) 


In steady-state the two terms on the right hand side balance each other in which case 


2 
neqie nA faf _, Me 
E= 7 f > 13.7 
ee 2 > er IKT” EP (13.78) 


If uye, is much smaller than the electron thermal velocity, then Eq.(13.64a) can be used to 
obtain 


neqie? nA Urel Zem! mA J 


E= =i 
3/TTEGMe (IKT /me)*!? BYmme§ (2K T.)°/? 


(13.79) 


where upe; = J /nee and uz! = 1/m; +1/Me ~ 1/Me have been used. Equation (13.79) 
can be used to define the electrical resistivity 


_ Zem! ln A 
8/779 (26T)? 


n (13.80) 


so that 
E =J. (13.81) 


If the temperature is expressed in terms of electron volts then « = e and the resistivity is 


ZA 
n = 1.03 x 107442 
TP 


e 


Ohm-m. (13.82) 
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13.4 Runaway electric field 


The evaluation of the error function in Eq.(13.78) involved the assumption that ure, the 
relative drift between the electron and ion mean velocities, is much smaller than the electron 
thermal velocity. This assumption implies that — (x~! erf x)! ~ 4a/3,/7, so that < lies 
well to the left of 0.9 in the curve plotted in Fig.13.1. Consideration of Fig.13.1 shows that 
— (x~ t erf z)' has a maximum value of 0.43 which occurs when x = 0.9 so that Eq.(13.78) 
cannot be satisfied if (Drecier 1959) 


2 3 
Nneqie MAO | _, Me neZe’ n A 
E -— f = 0.43——. 13.83 
a max f Anezme Ov > = OKT. 8rezKTe ( ) 


If the electron temperature is expressed in terms of electron volts, attaining a balance be- 
tween the acceleration due to the electric field and frictional drag due becomes impossible 
if 


E> EDreicer (13.84) 
where the Dreicer electric field is defined by 
eZee mA 
EDreicer = gdy a ma 


8rezKTe 


5.6 x 10- 18n, Z 24 
T, 


e 


V/m. (13.85) 


If the electric field exceeds E Dreicer then the frictional drag lies to the right of the max- 
imum in Fig. 13.1. No equilibrium is possible in this case as can be seen by considering the 
sequence of collisions of a nominal particle. The acceleration due to E between collisions 
causes the particle to go faster, but since it is to the right of the maximum, the particle has 
less drag when it goes faster. If the particle has less drag, then it will have a longer mean 
free path between collisions and so be accelerated to an even higher velocity. The particle 
velocity will therefore increase without bound if the system is infinite and uniform. In real- 
ity, the particle might exit the system if the system is finite or it might radiate energy. Very 
high, even relativistic velocities can easily develop in runaway situations. 


13.5 Assignments 


1. Evaluate the integral in Eq.(13.54) and show that it leads to Eq.(13.55). Hint: Since v 
is a fixed parameter in the integral, let the direction of v define the axis of a spherical 
polar coordinate system. Let € = v’ — v and let 0 be the angle between € and v. Then 
note that (v’ y = Ẹ? + 2€-v + v? and that dv’ = dé. Express d€ in spherical polar 
coordinates and let x = cos 0. 


2. Interaction of a low density, fast electron beam with a cold background plasma. As- 
sume that the velocity of a beam of electrons impinging on a plasma is much faster 
than the velocities of the background electrons and ions. The ions have charge Z so 
that, ignoring the beam density, the quasineutrality condition is Zn; = ne. 
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(a) Show that the Rosenbluth h potential can be approximated as 


hp(v) = =r f EW a 


e J IMS] 
Men F 
vpu 
where 
1 1 1 


uO me mp 
What is the form of hp (v) for beam electrons interacting with background 
plasma (i) electrons and (ii) ions? 


(b) By approximating 


P oarl 12 
(zv: v' +o) =o (17 X +5) 


where the beam velocity v is much larger than the background species velocity 
v', show that the Rosenbluth g potential can be approximated as 


gr (v) 


ip Favia 


( 1 E) 
v+ — NP. 
v 2mp 


Hint: use symmetry arguments when considering the term involving v - v’. 


I? 


(c 


wa 


Assume that the beam velocity is z-directed and let Fr = f d?v, fr so that Fr 
describes the projection of the 3-D beam distribution onto the z axis of velocity 
space. Since v = v, and vy = vz % 0 show that it is possible to write 


MenF 
Uzu 


glv) = (» + 


Show that the Fokker-Planck equation can be written as 


hp(v) 2s 


1 E) 
— n 
Vz 2MpF 


O F mT i 
Fr 3 npag mA ðv, | T pw? } ` 
ð 


4regm?, 1 2? 3KTF 
F? p 
20v ( Tav 72 T Om PU: 


y nrg nA ð pT, 3 KTP OF? 
dream? Ov, | * pw? w3 mp Ov, |” 


Fai,e 


R 


F=i,e 


(d) Taking into account charge neutrality and v, >> vr; show that this can be recast 
as 


OFr ne nA ð | 24+Z 3 KT | 
a T 


| 
Ot Ane2m?2 ðv, v2 2vu3 me Ovz 
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(e) Show that a steady-state equilibrium can develop where, because of collisions 
with background electrons and ions, the fast beam distribution has the form 


JEZ £ 2 
Pr ~ exp ( -22m 


3KT. 


Is this consistent with the original assumption that the beam is fast compared to 
the background plasma? 


. An axisymmetric plasma has a magnetic field which can be expressed as 


1 
B= on (VY x Vo + UIV eo) 

where ¢ is the toroidal angle, Y (r, z) is the poloidal flux, and J is the current flowing 
through a circle of radius r at axial position z. 

(a) Show that the toroidal component of the vector potential is 

1 
—yų t 
z YAM 24) 


(b) Assume that the plasma obeys the resistive Ohm’s law 


Ag(r,z,t) = 


E+UxB=nJ 


and assume that the plasma is stationary so that the toroidal component is simply 


and Eq.(9.42) to show that 
r 1 
Jg == V- Vy 
9 2T Ho (i v) 


so that the toroidal component of Ohm’s law is 
rn ia 
Eg =—- v- (ave). 


Assuming classical resistivity n ~ Te 3/2 sketch the temperature dependence 
of || as given above and also sketch the temperature dependence of Ep; eicer 
as given by Eq.(13.85). For a plasma with given w and physical dimensions, 
in what electron temperature limit do runaway electrons develop (high or low 
temperature)? For a given temperature, do runaways develop with high 7 or low 
w? For a given temperature and flux, do runaways develop in a large device or 
in a small device? If the plasma density decays, will runaways develop? 


(c 
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Wave-particle nonlinearities 


14.1 Introduction 


Linear models are straightforward and rich in descriptive power because they are based 
on a powerful, integrated set of concepts and tools consisting of eigenmodes, eigenvalues, 
eigenvectors, orthogonality, and integral transforms. These concepts and tools ultimately 
depend on the very essence of linearity, namely the principle of superposition. 

While many important phenomena are well characterized by linear models there never- 
theless exist other important phenomena where the principle of superposition breaks down 
either partially or completely. The phenomenon in question becomes amplitude-dependent 
above some critical amplitude threshold and then nonlinearity becomes important. Break- 
down of the superposition principle means that modes with different eigenvalues (to the 
extent eigenvalues still exist) are no longer orthogonal and start to interact with each other. 
These interactions result from non-linear terms in the system of equations, i.e., terms in- 
volving products of dependent variables. As an example of this, consider the product of 
two modes having respective frequencies w, and w2. This product can be decomposed into 
sum and difference frequencies according to standard trigonometric identities, e.g., 


cos |(w1 + w2)t] + cos [(w1 — w2)t] 
> : 


(14.1) 


COS w1t cos Wot = 


The beat waves at frequencies wı + w2 can act as source terms driving oscillations at 
Ww + we. In the special case where w = wa, the difference frequency is zero and the non- 
linear product can act as a source term which modulates equilibrium parameters and thereby 
changes the mode dynamics. In particular, feedback loops can develop where modes affect 
their own stability properties. Another possibility occurs when the nonlinear product is very 
small, but happens to resonantly drive a linear mode. In this case even a weak non-linear 
coupling between two modes can resonantly drive another linear mode to large amplitude. 
Similar beating can occur with spatial factors ~ exp(ik - x) so that the non-linear product 
of modes with wavevectors kı and kg drive spatial oscillations with wavevectors kı + kg. 
In analogy to quantum mechanics the momentum associated with a wave is found to be 
proportional to k and the energy proportional to w. The beating together of two waves with 
respective space-time dependence exp(ik, - x—iw,t) and exp(ik2 - x—iw2t) can then be 
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interpreted in terms of conservation of wave energy and momentum, 
W3 = Wi +rWe2 
kg = k, +k. (14.2) 


where wave 3 is what results from beating together waves 1 and 2. 

Because of the large variety of possible nonlinear effects, there are many ways to cat- 
egorize nonlinear behavior. For example, one categorization is according to whether the 
non-linearity involves velocity space (Vlasov non-linearity) or position space (fluid non- 
linearity). Vlasov nonlinearities are characterized by energy exchange between wave elec- 
tric fields, resonant particles, and non-resonant particles. Fluid instabilities involve non- 
linear mixing of two or more fluid modes and can be interpreted as one wave modulating 
the equilibrium seen by another wave. Another categorization is according to whether the 
non-linearity is weak or strong. In weak nonlinear situations, linear theory is invoked as a 
reasonable first approximation and then used as a building block for developing the non- 
linear model. In strong non-linear situations, linear assumptions fail completely and the 
non-linear behavior must be addressed directly without any help from linear theory. Weak 
non-linear theories can be further categorized into (i) mode-coupling models where a small 
number of modes mutually interact in a coherent manner and (ii) weak turbulence models 
where a statistically large number of modes mutually interact with random phase so that 
some sort of averaging is required. 

An important feature of non-nonlinear theory concerns energy. Because energy is a 
quadratic function of amplitude, energy does not satisfy the principle of superposition and 
so energy cannot be properly accounted for in linear models. Thus, non-linear models are 
essential for tracking the flow of energy between modes and also between modes and the 
equilibrium. 


14.2 Vlasov non-linearity and quasi-linear velocity space 
diffusion 


14.2.1 Derivation of the quasilinear diffusion equation 


Quasilinear theory(Vedenov, Velikhov and Sagdeev 1962, Drummond and Pines 1962, 
Bernstein and Engelmann 1966), a surprisingly complete extension to the Landau model of 
plasma waves, shows how plasma waves can alter the equilibrium velocity distribution. In 
order to focus attention on the most essential features of this theory, a simplified situation 
will be considered where the plasma is assumed to be one dimensional, uniform, and un- 
magnetized. Furthermore, only electrostatic modes will be considered and ion motion will 
be neglected. Thus the plasma is characterized by the coupled Vlasov and Poisson equa- 
tions for electrons and the only role for the ions is to provide a static, uniform neutralizing 
background. It is assumed that the electron velocity distribution function can be decom- 
posed into (i) a spatially independent equilibrium term which is allowed to have a slow 
temporal variation and (ii) a small, high frequency perturbation having a space-time depen- 
dence and which is associated with a spectrum of linear plasma waves of the sort discussed 
in Section 5.2. Thus, it is assumed that the electron velocity distribution has the form 


F(z,v,t) = folv,t) + filz,v,t) + fo(x,v,t) +... (14.3) 
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where it is implicit that the magnitude of terms with subscript n is of order e” where 
e << 1. Att = 0, the terms f,, where n > 2 all vanish because the perturbation was 
prescribed to be fı at t = 0. Other variables such as the electric field will have some kind 
of nonlinear dependence on the distribution function and so, for example, the electric field 
will have the form 


E = Eo + fh, + E2 + E3 +... 
where it is implicit that the nt” term is of order e”. 


(a) t=0 initial 
E(k,t) wave 
fo(v) energy 
spectrum 
k 
K min Kmax 
nonresonant Tesonant 
particles particles 
AN l 
ymin = @!K max Vres = OK min 
(b) t> œ final 
E(k,t) wave 
fo(v) energy 
spectrum 
“colder” 
k 
plateau Kmin K max 
nonresonant resonant 
particles particles 
A l 
ymin = O/K max Ves = æl k min 


Figure 14.1: (a) At t = O the equilibrium distribution function fo(v) is monotonically de- 
creasing resulting in Landau damping of any waves and there is a wave spectrum (insert) 
with wave energy in the spectral range kmin < k < kmax. Resonant particles are shown as 
shaded in distribution function and lie in velocity range v? < v < v™2*, (b) As t — 00 


the resonant particles develop a plateau (corresponding to absorbing energy from the wave), 
the wave spectrum goes to zero, and the non-resonant particles appear to become colder. 


Since by assumption, fo(v,t) does not depend on position, it is convenient to define a 
velocity-normalized order zero distribution function 


fo(v,t) = no fo(v, t) (14.4) 
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so that 
J hoda =1. (14.5) 


This definition causes no to show up explicitly so that terms such as (e / meo) Ofo/Ov can 
be written as wd fo/Ov. A possible initial condition for fo(v,t), namely a monotonically 
decreasing velocity distribution, is shown in Fig.14.1(a). 

Spatial averaging will be used in the mathematical procedure to filter out some types of 
terms while retaining others. This averaging will be denoted by () so that, for example, the 
average of fı is 


(fi(z,v,t)) = fe filz, t) (14.6) 
where L is the length of the one-dimensional system and the integration is over this length. 
The spatial average of a quantity is independent of x and so, since fọ is assumed to be 
independent of position, (fo) = fo. (In an alternate version of the theory, the averaging is 
instead understood to be over a statistically large ensemble of systems containing turbulent 
waves and in this version, the averaged quantity can be position-dependent.) 

The 1-D electron Vlasov equation is 


? (fo + fi + fot...) -oZ (fi + f2+---) 
— (Ei + B24 -JE (h tfi+f2+..) = 0; (14.7) 


note that there is no Ofo/Ox term because fp is assumed to be spatially independent and 
also there is no Ho term because the system is assumed to be neutral in equilibrium. The 
linear portion of this equation 
Of, HOF 6s ôðfo_ 
at” ax On =A 
forms the basis for the linear Landau theory of plasma waves discussed in Section 4.5. 
Subtracting Eq.(14.8) from Eq.(14.7) leaves the remainder equation 


(14.8) 


x (fo + fo+...)4 va, (fot...) z (E2 +...) z (fo + fi + fot...) 
Cn? (fi+fa+.) = 0. 


(14.9) 


We assume that the quantities with subscripts n > 1 are waves and therefore have spatial 
averages which vanish, i.e., (fı) = 0, (E1) = 0, etc. Also, since fo is independent of 
position, it is seen that (E2 fo) = fo (E2) = 0, etc. Spatial averaging of Eq.(14.9) thus 
annihilates many terms, leaving 


Ey fi) + (Ezf1) + --] (14.10) 


where /ôv has been factored out of the spatial averaging because v is an independent 
variable in phase-space. The term (F; f1) is of order €? whereas (Hf) is of order €? and 
the terms represented by the +... are of still higher order. The essential postulate of quasi- 
linear theory is that all terms of order €? and higher may be neglected because € is small. 
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On applying this postulate, Eq.(14.10) reduces to the quasi-linear velocity-space diffusion 
equation 
ol = <* (Ey fi). (14.11) 

This implies that even though fo is of order €°, the time derivative of fo is of order €? so 
that fo is a very slowly changing equilibrium. 

The nonlinear term (E; fı) can be explicitly calculated by first expressing the perturba- 
tions as a sum (i.e., integral) over spatial Fourier modes, for example the first order electric 
field can be expressed as 


1 = ; 
E (x,t) = FC (14.12) 


This allows the product on the right hand side of Eq.(14.11) to be evaluated as 


(Fifi) = z Sel (i aeoe) (se [w Alene") 


o 1 ~ z 1 i(ktk')a 
== ak | arki(t,0) fotz f dre (14.13) 


where the order of integration has been changed in the second line. Then, invoking the 
representation of the Dirac delta function 


1 ikx 
ô(k) = z | ave (14.14) 
Eq.(14.13) reduces to 
1 2 z 
(Ex: fi) coe IrL dk E1 (—k,t) fi(k,v, t). (14.15) 


The linear perturbations Æ; and f, are governed by the system of linear equations discussed 
in Section 4.5. This means that associated with each wavevector k there is a complex 
frequency w(k) which is determined by the linear dispersion relation D(w(k), k) = 0. This 
gives the explicit time dependence of the modes, 


E,(k,t) = É (kje (14.16a) 
Alk,v,t) = filk,vje i, (14.16b) 


Furthermore, Eq.(14.8) provides a relationship between E(k) and fı (k, v) since the spa- 
tial Fourier transform of this equation is 


8) 
E T E E (14.17) 
m ~ Ov 
which leads to the familiar linear relationship 
x ~ Ofo/O 
Fiery a (14.18) 
m w — kv 


Inserting Eqs.(14.18) and (14.16a) into Eq.(14.15) shows that 


i €e ~ ~ Ofo/Ov aA ey Call 
Piao f Noms ee dk E, (—k)E,(k i[w( k)+w(k)]t. 14.1 
(Ea fa) m 1(—k) Bi, h) Ae (14.19) 
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This expression can be further evaluated by invoking the parity properties of E(k) and 
w(k). These parity properties are established by writing 


1 = A . 
Ei(z,t)= > if dk Ey (k)el#?— ive (14.20) 


and noting that the left hand side is real because it describes a physical quantity. Taking the 
complex conjugate of Eq.(14.20) gives 


1 = PA atti 
E,(z,t) = A / dk Ex (k)e ihe tio” (AYE, (14.21) 
T 
The parity properties are determined by defining a temporary new integration variable k’ = 
—k and noting that [°° dk corresponds to f°. dk’ since dk’ = —dk and the k limits 
(—oo, 00) correspond to the k’ limits (co, —oo). Thus Eq.(14.21) can be recast as 
1 1 pax 1) Wik’ atiw* (—k')t 
Ex(x,t) = = | dk’ E}(-k’)e 
2T 
1 pok ikz+iw*(— 
E fa Ët (=k)elte tio’ (Ck) (14.22) 
where the primes have now been removed in the second line. Since Eq.(14.20) and (14.22) 


have the same left hand sides, the right hand sides must also be the same. Because Æ; (k) 
and w(k) are arbitrary, they must individually satisfy the respective parity conditions 


E(k) = E{(-k) 

w(k) = —w*(—k) (14.23) 
or equivalently 

Ex(-k) = Ey (k) 

w(—-k) = —wi(k). (14.24) 


If the complex frequency is written in terms of explicit real and imaginary parts w(k) = 
w,(k) + iw;(k) then the frequency parity condition becomes 


Wr(—k) +iwi(—k) = — [w,(k) + iw;(k)]” 
—w, (k) + iw;(k) (14.25) 


from which it can be concluded that 


w,(—k) = —w,(k) 
w;(—k) wi(k). (14.26) 


The real part of w is therefore an odd function of & whereas the imaginary part is an even 
function of k. 
Application of the parity conditions gives 


(14.27) 


(HĀ (k) = Ši) k) = of 
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and 
w(—k) + w(k) = 2iw;(k) (14.28) 
in which case Eq.(14.19) reduces to 
[w| 2 
i e 1 e^ a fo 
Efi) = dk : 14.29 
(Fifi) 27D =a w — kv Ov ( ) 


Substitution of (E; f1) into Eq.(14.10) gives the time evolution of the equilibrium distrib- 
ution function, 


2 
eri (k)t 


- 2 E(k 
Ofo M [aw Ae) Of (14.30) 


Ot — mL mv w — kv ðv ` 


o 2 
The quantity |Ē, (k)| has a physical meaning that is made evident by considering the 
volume average of the electric field energy 


eo hi? 
(a) 
Sa ED) dè (se faeo) (se f aw Baeren) 
2L 21 21 


c £0 ft aren) -ijoko e L i(k+k')x 
E fafa E(k) Ey (k’^)e A dz e . (14.31) 


However, using Eq.(14.14) this can be written as 


(We) 


ma, oe E (k) E (—kye ile) +(e 
(We) TnL dk E(k) Ey(—k)e 
= J dk E(k) (14.32) 


where 5 
Elkt) = |E] em" (14.33) 
Ar 

is the time-dependent electric field energy density associated with the wavevector k. A 
possible initial condition for the distribution function and wave energy spectrum is shown 
in Fig.14.1(a); the wave energy spectrum is shown in the insert and is finite in the range 
kmin < K < kmax and the distribution function is monotonically decreasing so as to cause 
Landau damping of the waves. 

Combination of Eqs.(14.30) and (14.33) shows that the evolution of the equilibrium 
distribution function is given by 


i ge 
Ofo 2i e O pu E(k,t) Ofo 


E w— kv v` 


(14.34) 


Ot Eg m? dv 


This can be summarized as a velocity space-diffusion equation 


afo _ 3 (2a #) (14.35) 


ot Ov v 
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where the quasi-linear velocity space diffusion coefficient is 


ie? Elkt) 
Dot = dk a 14.36 
Qh com? J w — kv ( ) 


The factor i in the velocity diffusion tensor seems surprising at first because fo is a real 
quantity. However, the i factor is entirely appropriate and is intimately related to the parity 
properties of w(k) since 


i ifn) = ko] +w:(k) 
w(k) — kv [w,(k) — ku? + w2(k) (14.37) 


The denominator in this expression is an even function of k, the term [w,.(&) — kv] in the 
numerator is an odd function of k, and w;(k) is an even function of k. Since E(k,t) is an 
even function of k, integration over k in Eq.(14.36) annihilates the imaginary component 
(this component is an odd function of k) and so 


e? 2wi(k)E(k,t) 
DoL = dk A 14.38 
Or sgm? / lw, (k) — kul? + w2(k) ee 


In summary, the self-consistent coupled system of equations for the non-linear evolution 
of the equilibrium consists of Eq.(14.35), (14.38), and 


< E(k +) = 2w; (KJE (kt) (14.39) 


which is obtained from Eq.(14.33). The real and imaginary parts of the frequency w, (k), 
wi(k) appear as parameters in these equations and are determined from the linear wave 
dispersion relation which in turn depends on fo. This linear wave dispersion relation is 
obtained as in Section 4.5 by writing E; = —0¢,/Oz and then combining Eq.(14.18) with 
Poisson’s equation 


k*¢,(k) = -> = | du fi (14.40) 
to obtain 
a foo kb, ( k) 
kg 14.41 
Pil Eom — fw Ov w—kv ( ) 
which can be expressed as 
w? Afo/d 
14 E fav fof o (14.42) 
k w — kv 


using Eq.(14.4). Thus, given the instantaneous value of fo(v,t) = no fo(v, t), the complex 
frequency is determined from Eq.(14.42) and then, given the instantaneous value of the 
wave spectral energy, both the evolution of fo and the wave spectral energy are determined 
from Eqs.(14.35) and (14.39). 
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14.2.2 Conservation properties of the quasilinear diffusion equation 


Conservation of particles 
Conservation of particles occurs automatically because Eq.(14.35) has the form of a 
derivative in velocity space. Thus, the zeroth moment of Eq.(14.35) is simply 


ð ð ð fo afo]? 
5, foh = f ws (2a £) = [Pa | PE =0 (14.43) 


and so the quasi-linear diffusion equation conserves the density n = f dv fo. 
Conservation of momentum 
Examination of momentum conservation requires taking the first moment of Eq.(14.35), 


fô) 8) ð f8) 
Z om) =m favor (2a) = -m | wpa 2. (14.44) 
Using Eq.(14.36) this becomes 
2ie? E(k,t) Ofo 
zS = — d dk 
ot van) m f i com? J w-— kv ðv 
. 1 fo 
= —2iw? | dk€(k,t) Jd —. 14.4 
w f ea f J ku Ov see) 
However, the linear dispersion relation Eq.(14.41) shows that 
1 Of 
2 
i d —-=-k 14.46 
af meko: Ov ( ) 
and so 
ð 
Ai (nmu) = 2i f ake (ktk =0 (14.47) 


which vanishes because the integrand is an odd function of k. Thus, the constraint provided 
by the linear dispersion relation shows that the quasi-linear velocity diffusion equation also 
conserves momentum. 

Conservation of energy 

Consideration of energy conservation starts out in a similar manner but leads to some 
interesting, non-trivial results. The mean particle kinetic energy is defined to be 


2 
Wp = / du fo. (14.48) 


The time evolution of Wp is obtained by taking the second moment of the quasi-linear 
diffusion equation, Eq.(14.35), 


OWp pmo afo 
a 2 av C” ay 
a 
=. Tel J dunia a (14.49) 


v` 
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Using Eq.(14.36) gives 
+2 
OWp EE ee 2ie dk E(k,t) Ofo 
Ot Eom? w — kv Ov 


w— kv Ov 


Elkt —kv —w df 
dia’ fak Eo fav? Deis 


2iw(k)E(k,t 1 Of 
= -u3 fak AOE) foe. (14.50) 
Invoking Eq.(14.42) to substitute for the velocity integral results in 
ow, 
ae = fa 2i [wr (k) + iwi(k)] E(k,t). (14.51) 


Since wp (k) is an odd function of k and w;(k) is an even function of k, only the term 
involving w; survives the k integration and so 
OWp 
ot 
Using Eq.(14.39) this becomes 


+ [ak Qu i(k)E(k,t) = 0. (14.52) 


a 
a [we + facwn] =0 (14.53) 


which can now be integrated to give 


Wp + fewn = Wp + Wp = const. 


showing that the sum of the particle and electric field energies is conserved. The particle 
energy and the electric field energy therefore need not be individually conserved — only 
the sum of these two types of energy is conserved. This result allows for energy exchange 
between the particles and the electric field. 


14.2.3 Energy exchange with resonant particles 


More detailed insight is obtained by considering the role of resonant particles, i.e., those 
particles having velocity v ~ w/k as indicated by the shaded region in Fig.14.1. This is 
done by using Eq.(14.37) to re-write the top line of Eq.(14.50) as 


aw, 2e i fwr (k) — ku] +wi(k) 8 
Wr = — dk E(k.L) / duv Ss + ao J 
9,2 vwi(k) afo 
aut f akela) fav OIA (14.54) 


where because w, (k) — kv is an odd function of k, only the w; (k) numerator term survives 
the & integration. The velocity integral can be decomposed into a resonant portion which 
is the velocity range where wy ~ kv and the remaining or non-resonant portion. In the 
resonant portion, it is possible to approximate 


Wi T Wp 


=) 


(14.55) 
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while the non-resonant portion can be written as a principle-part integral. Thus, Eq.(14.54) 


becomes 
P fav vw; (k) -2h r% (32) l 
(wr — kv)” Ov k OU wily 


(14.56) 

A relationship between the principle part and resonant terms in this expression can 

be constructed by similarly decomposing Eq.(14.42) into a resonant portion and a non- 

resonant or principle part. The principle part is Taylor expanded as a function of wp + iw; 

where w; is assumed to be much smaller than w,.. This procedure is essentially the scheme 

discussed in the development of Eq.(5.81), but to emphasize the details the expansion of 

the principle part is written explicitly here. Thus, the linear dispersion relation Eq.(14.42) 
can be expanded as 


OW p 


= -už fak 2E (kt) 


k2 Wr iwi 
k k 
w2 Ofo/d Of 
Ee, Sp pel ie E Lees (Oto 
k? We W ðv v=wr/k 
k k 
w2 Ofo/d iw; ô Ofo/d Of 
= 1-2 P fa Race oP f ew ina Hir ( h) 
k OE. k A(z) UT ðv v=wr/k 
w2 Ofo/d iwi afo /ð Of 
ane p ey R 5 Wee fay ION: ae a 
k? ia en k ( 2) av an 
DE, v=wr/ 


(14.57) 


The imaginary part of the last line must vanish and so 


Wi Ofo/Ov (32) 
“pla ' =0 14.58 
k i "Wur N oar a 


which leads to the usual expression for Landau damping. 
If it is assumed that w, /k >> vr then the principle part integral in Eq. (3.238) can be 
approximated as 


v Ofy 1 afo 1 - 1 
Pd ~ d = d = 14. 
i j (wr — kv)? ðv w? if a w? J E We Cie 


r r 
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and the principle part integral in Eq.(14.58) can similarly be approximated as 


Ofo/Ov OA 
P fo 7 [ehig (v — wr / k)? 


fo 
2 [aw Gane 


k3 
© 2 (14.60) 
Using Eq.(14.59), Eq. (14.56) becomes 
OWp 2 wilk) Wr afo 
= dk 2€ (k,t 14.61 
K of «| ces ee (14.61) 
and using Eq.(14.60), Eq. (14.58) becomes 
k? Of 
Quiz +T ( k) =0. (14.62) 
w3 Ov $= Si 


Comparison of the above two expressions shows that the second term in the square brackets 
of Eq.(14.61) has twice the magnitude of the first term and is of the opposite sign. Since 
we ~ w?, this means that Eq.(14.61) is of the form 


Wp ð a f 
= — WP non-resonan = WP resonant = d?’ kE kt 2w; — 4w; 14.63 
ae = Wr, t+ Woresonant = | REÇ) Dwi — dui] (14:63) 


where the 2w; term prescribes the rate of change of the kinetic energy of the non-resonant 
particles, and the —4w; term prescribes the rate of change of the kinetic energy of the 
resonant particles. On the other hand, Eq.(14.52) showed that the rate of change of the total 
particle kinetic energy was equal and opposite to the total rate of change of the electric field 
energy. These two statements can be reconciled by asserting that the wave energy consists 
of equal parts of non-resonant particle kinetic energy and electric field energy and that the 
resonant particles act as a source or sink for this wave energy. This energy budgeting is 
shown schematically as 


| Eraiketa) + | EKER) => J Proe 


1—— m 1——— 1— 
kinetic energy energy stored in kinetic energy 
of non-resonant particles electric field of resonant particles 
N ew 
wave 
energy 


(14.64) 

Behavior of the resonant particles 
We define fres as the velocity distribution of the resonant particles, i.e., the particles 
with velocities v ~ w/k for which €(k,t) is finite. Since the velocity range v = w(k)/k 
of the resonant particles maps to the spectrum €(k,t), the upper and lower bounds of the 
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resonant particle velocity range respectively map to the lower and upper bounds of the 
values of k for which €(k,t) is finite. For these particles, the delta function approximation 
Eq.(14.55) can be used to evaluate the quasilinear diffusion coefficient given by Eq.(14.38) 
and obtain 


2 
Donres(¥) ~ m / dk 5(w, — kv) E(k,t) 
Qre? E(k,t) 
R f&s- 7 
Z 2re? E(w,/v, 1) (14.65) 
Eom? v : 


Using this coefficient the quasilinear velocity diffusion for the resonant particles is 


Ofo.res Qre? O (E(wr/v,t) afores 
: = ; ; 14.66 
ot Eom? Ov v Ov ( ) 
It is seen from Eq.(14.62), the generalized formula for Landau damping, that 
ð k? 
T ($) = 2wi ~z No (14.67) 
V Jy wr/k Wr 
and so Eq.(14.66) becomes 
forres E( Ca te oe . 
ot 7 om 5 Yi $ 
_ wp O Sto t) 5 l 
~ mð me 
2wp O O a 
~ m ot dv (F v3 proB 


where wr = Wp = y/no e? /egm has been used and also, because the particles are resonant, 
v ~ w,/k. This can be integrated with respect to time to obtain 


Quy ð (Gabe fed) (14.69) 


fo,res (v,t) foyres(v, 0) = 


m Ov v3 


By definition €(w,./v,t) vanishes for v lying outside the resonant particle velocity range 
umin < y < vma* because outside this range there is no wave energy with which the 
particles can resonate (see Fig.14.1). Hence, integration of Eq.(14.69) over the velocity 


max 


range ures < v < VRA~ of the resonant particles gives 


max max 


Ures Ures 
f dv fores (v, t) = i dv fo,res(v, 0) (14.70) 
ones ures 
which shows that the number of resonant particles is conserved. 

Equation (14.64) showed that the resonant particle energy is not conserved and can be 
exchanged with the wave energy. Thus, the zeroth moment of the resonant particles is 
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conserved, but the second moment is not. Change of the resonant particle energy while 
conserving their number is achieved by adjusting the number of resonant particles that 
are slightly slower and slightly faster than w/k. For example, if there is a decrease in the 
number of resonant particles having v ~ (w/k)_ there must be a corresponding increase in 
the number of resonant particles having v ~ (w/k)+. This process provides a net transfer 
of energy from the wave to the resonant particles, involves wave Landau damping, and 
requires having Ofo,res/Ouv < 0. The result, increasing the number of resonant particles 
having v œ (w/k), while decreasing the number having v ~ (w/k)_ flattens fo res(v) 
and so makes it plateau-like as indicated in Fig.14.1(b). 

Behavior of the non-resonant particles 

The quasi-linear diffusion coefficient for the non-resonant particles comes from the 
principle part of the integral in Eq.(14.38), i.e., 


2 Qu (KE (kt 
Datnon-res = —— p far wil) ( van (14.71) 
Eom [ur (k) — kv] + w?(k) 


The vast majority of the non-resonant particles have velocities much slower than the wave, 
so for the non-resonant particles it can be assumed v << wr /k in which case 


e? fa 2w;(k)E (k.t) 


D@QL,non—res 


Eom? w2(k) 
1 
~ f dk 2w; (k)E(k,t) . (14.72) 
mno 


Since this non-resonant particle velocity space diffusion coefficient is velocity-independent, 
it can be factored from velocity integrals or derivatives. Equation (14.11) showed that the 
change in fo is order €? where e << 1. This means that changes in fo are small compared 
to fo. On the other hand, there is no zero-order wave energy since the wave energy scales 
as E? and so is entirely constituted of terms which are order €?. Thus, the wave energy 
spectrum can change substantially (for example, disappear altogether) whereas there is a 
only slight corresponding change to fo. Thus, Eq.(14.35), becomes 


Ofo non—res 1 o O fo non—res 
= dk 2w;(k)E (k t) oes 
Ot Mno a BE ee) Ov 
1 d 2? fo non—res 
— | dk&(k,t) ) ———— 14.73 
mno ca ( D) Ov? ( ) 


which can be integrated to give 


2 
NEN E E E Cn) A ( / dk E(k) = £(4.0)]) A fo,non-res 


Mno Ov? 
(14.74) 
Since the number of resonant particles is conserved, the number of non-resonant particles 
must also be conserved. 
If an initial wave spectrum becomes damped at t = oo then it is possible to write 


2 
[fo Eo) ə D o 
(14.75) 


fo,non—res (v, co) = foRon ares (v,0) = 
mno 
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In the range of velocities where 0? Tomon=res). ðv? > 0 there will be a decrease in the 
number of non-resonant particles and vice versa in the range where 8? dO men ress, Ov? <0. 
An initially Maxwellian distribution (which has 0? fononareal Ov? < 0 for very small 
velocities and vice versa for very large velocities) will develop a double-plateau shape 
(high plateau at low velocities and low plateau at high velocities) with a sharp gradient 
between the two plateaus. The lower plateau will merge smoothly with the plateau of the 
resonant particles. The non-resonant portion of the velocity distribution will appear to 
become colder as the waves damp as indicated in Fig.14.1(b). 

This apparent cooling can be quantified by introducing an effective temperature Te ș p 
which is defined to have the time derivative 


d 2 d 
dt (KTe pf) = Ty dt facwn. (14.76) 


This expression can be integrated to obtain 


2 
KT ep p(t) = KT + — i, dk E(k,t) (14.77) 
no 


where To is the temperature for the situation where there are no waves. 
Using Eq.(14.76), Eq.(14.73) can be written as 


o d KTeff 3? fo non—Tres 
BD, J0,non—-res = : . 14.7 
ail. dt ( 2m ) Ov? eae) 
If fo,non—res is considered a function of KT. 7 instead of t, Eq.(14.78) can be written as 
O 1 2? non—-Tres 
1 fomon—res (14.79) 


AeTa py) omens = Sy ape 


which has the appropriately normalized solution 


f m mv? 
non—res — 1 ex 
0, i 2nKTeff (t) a 2KTeff (t) 


m/20 mv? /2 


J exp 5) 
kTo + — f dk E(k,t) kTo + — f dk E(k,t) 
no no 


no 


(14.80) 


Thus wave damping [i.e., the reduction of €(k,t)] corresponds to an effective cooling of 
the non-resonant particles. As shown in Eq.(14.64) this kinetic energy reduction is accom- 
panied by an equal reduction in the electric field energy and all this energy is transferred to 
the resonant particles. 


14.3 Echoes 


Plasma wave echoes (Gould, O’ Neil and Malmberg 1967, Malmberg, Wharton, Gould and 
O’Neil 1968) are a nonlinear effect that provide some very useful insights into the Landau 
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damping mechanism and raises some interesting questions about how Landau damping re- 
lates to entropy. This arises from consideration of the following two apparently conflicting 
observations: 
1. Wave damping should destroy the information content of the wave by converting or- 
dered motion into heat. Thus, wave damping should increase the entropy of the sys- 
tem. 


2. The collisionless Vlasov equation conserves entropy. This is because collisions are 
the agent that increases randomness and hence entropy. A collisionless system is in 
principle completely deterministic, so that the future of the system can be predicted 
with complete precision simply by integrating the system of equations forward in time. 
This entropy-conserving property of the collisionless Vlasov equation can be seen by 
direct calculation of the rate of change of the entropy, 


= = g Je ferenn In f(z, v, t) 
= ea 
= - fav fav mfa ) (ve + See) 
= “fee fe Carn ot ie 2) gms- f) 
= (14.81) 
since 
of of 
[oum = o faz mt- 148 


So, what really happens- does Landau damping increase entropy or not? The answer 
goes right to the heart of what is meant by entropy. In particular, it should be recalled 
that entropy is defined as the natural logarithm of the number of microscopic states corre- 
sponding to a given macroscopic state. The concept “macroscopic state” presumes there 
exist macroscopic states which are (i) composed of different microscopic states and (ii) for 
all intents and purposes indistinguishable. Collisions would cause the system to continu- 
ously evolve through all the various microscopic states and an observer of the macroscopic 
system would not be able to distinguish one of these microscopic states from another. 

The paradox is resolved because the concept of many microscopic states mapping to a 
single macroscopic state fails for Landau damping since the physical system for the Landau 
damping problem is actually in just one well-defined state which can be calculated. The 
macroscopic state therefore maps to just one microscopic state and so the system does not 
continuously and randomly evolve through a sequence of microscopic states. 

Landau damping does not involve turning ordered information into heat. Instead, macro- 
scopically ordered information is turned into microscopically ordered information. The in- 
formation is still there, but is encoded in a macroscopically invisible form. A good analogy 
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to this situation is the process of making a holographic image of an object. The hologram 
looks blank unless illuminated in such a way to provide the proper decoding. If an observer 
does not know how to do the decoding, the information is lost to the observer, and the en- 
tropy content of the hologram is high compared to a regular photograph. However, if the 
observer knows how to decode the microscopically stored information, then no information 
is lost. 

The Landau damping process scrambles the phase of the macroscopic information and 
encodes this as microscopic information which is normally irretrievable. If one does not 
know the appropriate trick for unscrambling the microscopically encoded information, it 
would be tempting to state that entropy has increased. However, the appropriate trick exists 
and has been demonstrated in laboratory experiments. Information that appeared to have 
been lost is retrieved and so entropy is not increased. The trick involves unscrambling the 
information encoded in the well-defined microscopic state. 

To get an idea for the issues involved, we first consider the simple problem of a one di- 
mensional beam of electrons having initial velocity vo. The beam is transiently accelerated 
by an externally generated, spatially periodic electric field pulse Æ = E cos(kx)6(t) where 
E is considered to be infinitesimal. The equation of motion for the electron beam is 


Ov Ov = 
m (F + oF) = —eF cos(kx)6(t) (14.83) 
and linearization of this equation gives 
vı i Ov, = — 
m( AE tag ) = —eE cos(kx)d(t). (14.84) 


It is convenient to use complex notation so that cos kz — e'*” and it is understood that 
eventually, the real part of a complex solution will be used to give the physical solution. It 


is therefore assumed that vı ~ e** and so the linearized equation becomes 


a (F + ikugv, ) = —eFô(t). (14.85) 


Next it is argued that since the delta function can be considered as the superposition of an 
infinite spectrum of harmonic oscillations, i.e., 


1 ; 
s(t) = > J et dy (14.86) 


the response of the beam to each frequency component can be considered. Thus, we con- 
sider the equation 


Ody E 
EET a ae eee (14.87) 
oe Ot m 
so that the net velocity is 
1 
v(t) = yf tiles. (14.88) 


Note that Č; (w, t) is not a Fourier transform because it contains the explicit time depen- 
dence. 

Since vo is the initial beam velocity, vı (t) must vanish at t = 0 providing a boundary 
condition for Eq.(14.87) at t = 0. One way to solve this equation is to first assume that 
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01 (w, t) consists of a particular solution satisfying the inhomogeneous part of the equation 
(i.e., balances the driving term on the right hand side) and a homogeneous solution (i.e., a 
solution of the homogeneous equation (left hand side of Eq.(14.87))). The coefficient of the 
homogeneous solution is chosen to satisfy the boundary condition at t = 0. The particular 
solution is assumed to vary as e'**~'*¢ and so is the solution of the equation 


eE 


(—iw + ikvo) ŭi = ——e™. (14.89) 
m 
The homogenous solution is the solution of 
0b, 
LA A koors 0 (14.90) 


ot 

and has the form 6? = \exp(—ikvot) where A is a constant to be determined. Adding the 
particular and homogeneous solutions together gives the general solution 

eE ie it 


m (w— kvo) 


= H Ae ikot (14.91) 


where A is chosen to satisfy the initial condition. The initial condition vı = O att = 0 
determines A and gives 


ieE (e= — e` ikvot) 


m (w — kvo) 


O1(w,t) = (14.92) 
as the solution which satisfies both Eq.(14.87) and the initial condition. The term involving 
e ‘vt is called the ballistic term. This term contains information about the initial condi- 
tions, is a solution of the homogeneous equation, is missed by Fourier treatments, is incor- 
porated by Laplace transform treatments, and keeps vı from diverging when w — kv — 0. 

If we wished to revert to the time domain, then the contributions of all the harmonics 
would have to be summed, giving 


ieE Ge — e` ikvot) 


t) == —— De 
v(t) 2am : (w — kvo) 


(14.93) 


14.3.1 Ballistic terms and Laplace transforms 


The discussion above used an approach related to Fourier transforms, but added additional 
structure to account for the initial condition that vı = 0 att = 0. This suggests that Laplace 
transforms ought to be used, since Laplace transforms automatically take into account ini- 
tial conditions. Let us therefore Laplace transform Eq.(14.87) to see if indeed the particular 
and ballistic terms are appropriately characterized. The Laplace transform of Eq.(14.85) 
gives 


eE [® , 

põ + ikvoŭı = —— dt e` iwt—pt 
m Jo 
eE 1 


hun (14.94) 
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so = 
E 
Õõ = 7 a dk : . (14.95) 
(p + iw) (p + ikvo) 
The inverse Laplace transform gives 
: 1 ep p% ert 
= (14.96) 


anim Jaio (p+ be) (p+ ikvy) 
Analytic continuation allows the contour to be completed on the left hand side and it is 
seen that there are two poles, one at p = —iw and the other at p = —ikv. Evaluation of the 
residues for the two poles gives 


ert 
= 277i lim + IW) x - 
a= -LĒ pay O e 
271m S . eP 
2 l Hik ; ; 
eNA pa Siki R (p + iw) (p + ikvo) | 
E —iwt —ikvot 
= £2 4 E (14.97) 

m (—iw + ikvo) (—ikvo + iw) 


which is the same as Eq.(14.92). The ballistic term ~ e~'*%* thus results from a pole 
originating from the Laplace transform of the source term, whereas the homogeneous term 
~ e results from the pole originating from the factor (p + ikvg) that appeared in the 
left hand side of Eq.(14.94). 


14.3.2 Phase mixing of the ballistic term for multiple beams 


Now suppose that instead of just one electron beam, there are multiple beams where 
the density of each beam is proportional to exp(—vg/v7.); this would be one way of 
characterizing a Maxwellian velocity distribution. Superposition of the ballistic terms from 
all these beams leads to a vanishing sum, because the ballistic terms each have a velocity- 
dependent phase and so the superposition would give destructive interference due to phase 
mixing. In particular, the superposition would involve integrals of the form 


272 4% 229 . 2 2,2 4.2 
J aroeira = ek upt Hf dune (voles Host/2) = rure" opt” /4 


(14.98) 
which would quickly become extremely small. This suggests the ballistic term has no en- 
during macroscopic physical importance and, in fact, this is true for linear problems where 
the ballistic term is typically ignored. However, the ballistic term can assume importance 
when nonlinearities are considered. 


14.3.3 Beam echoes 
The linearized continuity equation corresponding to Eq.(14.84) is 
ə On, ð 
z vt ingot =0 (14,99) 
Ox 


and so, invoking the assumed e'** dependence, this becomes 


a 
S Eikon tiknir = 0. (14.100) 
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In analogy to Eq.(14.88) the linearized density can be expressed as 


ni(t) = z [aena (14.101) 


where again ñ (w, t) is not a Fourier transform because ñı(w,t) contains a ballistic term 
incorporating information about initial conditions. The equation for each frequency com- 
ponent thus is 


On. 
a + ikuofiy + iknod, = 0 (14.102) 
which may be Laplace transformed to give 
(p + ikvo) ñi + iknoŭı = 0 (14.103) 
or, using Eq.(14.95), 
iknoeE 
ñ (p,w) = ange a, (14.104) 


(p + iw) (p + ikvo)” 
There is now a second-order pole at p = ikvo and so there will be a ballistic term ~ 
exp(ikuot) associated with the density perturbation. This ballistic term will also phase mix 
away if there is a Gaussian velocity distribution of beams. 
In order to consider non-linear consequences, we consider the second-order continuity 
equation 


ane on uy an 2 (nv) =0 (14.105) 
which has inhomogeneous (forcing) terms such as nv involving products of linear quanti- 
ties. Suppose that in addition to the original pulse with spatial wavenumber k at time t = 0 
an additional pulse is also imposed with wavenumber k’ at time t = 7. This additional pulse 
would introduce ballistic terms having a time dependence ~ exp(ik'vo (t — T)). Thus the 


nonlinear product nv; has a dependence 


nıvı ~ Reexp(ikuot) x Reexp(ik'vo (t — T)) (14.106) 


which contains terms of the form exp(ikuot — k'vo (t — T)). In general, this product of 
ballistic terms would phase mix away if there were a Gaussian distribution of beams, just 
as for the linear ballistic term. However, at the special time given by 


kt — k! (t —T) =0 (14.107) 


the phase of the nonlinear ballistic term would be zero for all velocities, and so no phase 
mixing would occur when the velocity contributions are summed. Thus, at the special time 
kit 
t= —— 14.1 

ki -k Gee 
the non-linear product is not subject to phase-mixing and the superposition of the non-linear 
ballistic terms of a Gaussian distribution of beams gives a macroscopic signal. The time 
at which the phase of the non-linear ballistic term becomes stationary can greatly exceed 
T and so a macroscopic nonlinear signal would appear long after both initial pulses have 
gone. This ghost-like non-linear signal is called the fluid echo. 
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14.3.4 The self-consistent non-linear Vlasov-Poisson problem 


These ideas carry over into the Vlasov-Poisson analysis of plasma waves, but several ad- 
ditional issues occur which complicate and obscure matters. First, the problem takes place 
in phase-space so instead of having a pair of coupled equations for velocity and density, 
there is only the Vlasov equation. The Vlasov equation not only contains a convective 
term analogous to the fluid convective terms but also incorporates an acceleration term 
which describes how the velocity distribution function is modified when particles undergo 
acceleration and change their velocity. As in the fluid equations there is a coupling with 
Poisson’s equation. This coupling not only provides the self-consistent interaction giv- 
ing plasma waves, but it also provides for Landau damping. Landau damping is essen- 
tially a phase mixing of the Fourier-like driven terms, each of which scales as f du(w — 
kv)~! exp(—iwt)0fo/Ov. However, linear ballistic terms must also be excited in order to 
satisfy initial conditions. These ballistic terms scale as f du(w—kv)~! exp(—ikvt)Ofo/Ov 
and also phase mix away because of the exp(—ikvt) factor. 

If there are two successive pulses, then the non-linear ballistic term will again have a sta- 
tionary phase (i.e., phase independent of velocity) at the special time given by Eq.(14.108). 
This time could be arranged to be long after the linear plasma responses to the two pulses 
have Landau damped away so that the echo would seem to appear from nowhere. Thus, 
the Vlasov-Poisson analysis contains essentially similar echo physics, but in addition has a 
self-consistent treatment of the plasma waves and their associated Landau damping. 

To proceed with the Vlasov-Poisson analysis, we begin by considering Eq.(14.7) again, 
which is rewritten below for convenience 


8) 


ð 
z Vo a ie a oe v (fo + fi + f2 +...) 
e o 
(Eo + E1 + E2 +.) — (fo + fı + f2 +...) = 0. (14.109) 
m Ov 
Equilibrium is defined as the solution obtained by balancing all the zeroth order terms, 
i.e., 
ð 8) ð 
fo , ,Pfo _ A Ofo _ 9 (14.110) 


Ot Ox m Ov 
This is trivially satisfied by having Eo = 0, Ofo/Ot = 0, Ofo/Ox = O and fo(v) arbitrary; 
we will make these assumptions here. This equilibrium solution is then subtracted from 
Eq.(14.109) leaving 


ð ð e ð 
a tt + fo+..)4 v5, i + fo+..) mE (Jo Efi fa Fi) 
e ð 
E> +.. y+.) = 0. 
£ (E+) = (fot fit-) 
(14.111) 
The first-order solution is defined as the solution to 
ð, fi e n Ofo _ 
aT! Hv Əz me Av = 0, (14.112) 


the equation obtained by retaining all the first-order terms. The first-order solution is then 
subtracted from Eq.(14.111) and what remains are second and higher order terms. Drop- 
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ping terms higher than second-order gives the second-order equation 


oh, 2h = oho — By oh =0. (14.113) 
The special case where Æ; fı has a zero frequency component was discussed in the previous 
section on quasilinear diffusion and so here only the situation where Æ fı has a finite 
frequency will be considered. Since the first-order system has been solved, the last term in 


Eq.(14.113) can be considered as a source term and so we re-arrange the equation to be 


Of eK, Ofo _ € O 

Ot Ai Ox m æ mov 

where Æ; and fı are known and fo is to be determined. The specification that f = fo + fi 

at t = 0 provides the boundary condition fọ = O at time t = 0. Because Eq. (14.114) 

has both a self-consistent electric field contribution (/ term on the left hand side) and a 

prescribed electric field term (E; term on the right hand side), it has aspects of both the 
self-consistent problem and of the problem where the electric field is prescribed. 


(Ei fı) (14.114) 


The linear problem Suppose periodic grids are inserted into a plasma and the grids are 
transiently pulsed thereby creating the potential ¢,,,, (x,t). The charged particles will move 
in response to this applied potential and the resulting displacement produces a perturbation 
of the plasma charge density. Poisson’s equation must therefore take into account both the 
charge density on the grid and the resulting plasma charge density and so has the form 


1 
(grid charge density + plasma charge density) . (14.115) 


V°¢o, = -> 
dy a 


Since the grid charge density is related to the grid potential by 


1 
V beat = —— (grid charge density ) (14.116) 
E0 
Poisson’s equation can be recast as 
2 2 € 
VA ENV beat + — fra (14.117) 
E0 
A Fourier-Laplace transform operation is then applied to Eq.(14.117) to obtain 
-kJ = khen t — fio. (14.118) 
E0 
However, using Eı = —V¢, the Fourier Laplace transform of Eq.(14.112) gives 
z __.€ kh H 
on 14.11 
fi 'mp +ikv Ov ( A 
so that Eq.(14.118) becomes 7 
7 Deut (p, k) 
(p: k) = yp (14.120) 
P= Dipa) 


where 


e? k a fo 
D(p,k)=1 f; T dv (14.121) 
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is the usual self-consistent linear dielectric response function. Thus, the driven first-order 
velocity distribution function can be written as 


Î = a k PerilP, k) Ofo 
i mp+ikv D(p,k) v` 


(14.122) 


If an inverse transform were to be performed on rie then three issues would have to be 
taken into account. These are (i) a ballistic term ~ exp(ikvt) associated with the pole 
due to p + ikv in the denominator, (ii) a Landau-damped plasma oscillation due to the 
root of D(p, k) in the denominator, and (iii) the direct inverse transform of the numerator 
Deut (D> k) which by itself would reproduce the original imposed potential, but which will 
be modified due to the other factors. 


Fourier-Laplace transform of the non-linear equation As discussed in Eq.(14.1) 
nonlinear quantities provide sum and difference frequencies. For example, an oscillation at 
frequency w would result from the nonlinear product of a term oscillating at w — w’ and a 
term oscillating at w’ since (w — w’) + w’ = w. This can be written in a more formal and 
more general way using convolution integrals for both Fourier and Laplace transforms. 

Since Laplace transforms will be used for the temporal dependence and since the right 
hand side of Eq.(14.114) involves a product term, it is necessary to consider the Laplace 
transform of a product, 


L£(g(t)h(¢)) 


os g(t)h(t)e dt 
0 


oo 1 b+ice ; 
Í z G(p')e? tdp' h(t)e ”*dt 
0 1 Jb 


1 b+ioo oo o Je 

= g(p' h(t)e\” ~P)*dt | dp’ 
=f atv’) (| (i)e ) D 
1 b+ioo 7 

= ae 9(p')h(p — p')dp’. (14.123) 
T1 Jb—ioo 


This means that the Laplace transform with argument p results from all possible products 
of g(p') with h(p — p’). If exp(at) is the fastest growing term in g(t) and exp(at) is the 
fastest growing term in h(t) then b > a; is required in order for the Laplace transform of 
g(t) to be defined. Furthermore, in order for the Laplace transform of h(t) to be defined, it 
is necessary to have Rep — Rep! > a2 or Rep > Rep’ + az which implies Rep > b+ ag. 
These requirements can be summarized as Rep — ag > b > ay. 


Using 
gk) = i, g(x) e~ ** da (14.124a) 
l p +ikax 
ola) = J(k) eT '** dk, (14.124b) 


—co 
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a similar procedure for Fourier transforms gives 


F (g(a)h(x)) 


i g(x)h(a)e** da 


—co 


= T E E g(k'je t” an h(x)el*" dex 
z a dk'ğ(k ii h(x) lt) rar 


Sug AKIK- k’). (14.125) 


Thus, the Fourier-Laplace transform of Eq.(14.114) gives 


(p + ikv) fo(p,k) + rég pk k) 


dk! tise q 
w ia. oP wy (p kEi- pk- | . (14.126) 


Because the convolution integrals are notationally unwieldy, to clarify the notation we 
define the new dummy variables 


m a 


ae oe 
p = p-p (14.127) 
so that Eq.(14.126) becomes 
ee Ofo ~ 
(p +ikv) f2(p, k) + ik— hlp, k) 


dk' b+ico dp' eth 
a A ee a dı (p',k')fi(p',k ] . (14.128) 


The factors in the convolution integral can be expressed in terms of the original driving 
potential using Eqs.(14.120) and (14.122) to obtain 


A o 
(+ itv) lp,k) +i 2205, op, k) = Zp, ko) (14.129) 


where the non-linear convolution term is 


ear = ey af p Peat P's’) 
b D(p',k’) 


ik! beae(D' k’) Afo (14.130) 


pl +ik'u D(p',k'!) Ov 
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Equation (14.129) may be solved for f2(p, k) to give 


P oe sE k Ofo ~ 1 (6) 
hp. k) = m (p+ikv) ð y P20, le (p + ikv) Ov au* 


However, the Fourier-Laplace transform of the second-order Poisson’s equation gives 


x(p, k,v). (14.131) 


—k?bo(p,k -.f folp,k (14.132) 


and so substituting for fo(p, k) gives 


—kbo(p,k) = = || a a Oua ppa Ee dv (14.133) 


m (p + ikv) dv (p +ikv) Ov 

or 

z e 1 Ox 

k) = d 
dalp, k) k?eoD (p, k) lem av 
e ikX 
7 dv. (14.134) 
k?eoD (p, k) i (p + ikv)? 

Substitution for x and using the velocity-normalized distribution function fo = fo/no 


discussed in Eq.(14.4) nai 


3 dk' b+ico v 
k) = SEN 
ben = pee fav ff 


= ik k' Dent (D' ,k' ) ik’ os k') fo f 


(p+ikv)? D(p',k') pl +ik'v D(p',k') dv 
(14.135) 


Double-impulse source function and its transform 

In order to proceed further, the form of the external source must be specified. We 
assume that the external source consists of two sets of periodic grids which are pulsed 
sequentially. The first set of grids has wavenumber ka and is pulsed at t = O whereas the 
second set of grids has wavenumber ky and is pulsed after a delay 7. Thus, the external 
source has the form 


Pertl 2, t) = bg cos (kax) E(wpt) + Q cos (kox) 6(wp(t — T)). (14.136) 
The Fourier-Laplace transform of this source function gives 


Pezi(P, k) 1 m Pertz, t) er- Ptdrdt 
—co JO 
= LY {p(k ka) + by5(K+ ky) PT}. (14.137) 


Wp as 


Since we are interested in the non-linear interaction between the a and b pulses, only 
the contribution from the a pulse in the first ¢ṣ,+ factor in Eq.(14.135) and the contribution 


ext 
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from the b pulse in the second @,»; 
substitute the a contribution in Eq.(14.137) for the first 


factor in Eq.(14.135) will be considered. We therefore 
factor in Eq.(14.135) to obtain 


ext 


p WpTQg dk’ pon ae ikk'5(k! + ka 
ben = ieo ES a 
k?D(p,k) m b (p + ikv)? D(p', k') 


x ik' dent ((P — p'),k k') ô fo 
(p-p')+ik'v D(p-p',k') æ` 


(14.138) 


The effect of the 6(k’ + ka) factor when evaluating the k’ integral is to force k' + +ka 
and also k’ — k = ka so that 


b+ico zi 
7 7 i dp’ (Fike) 
2 (p, k) a k2D (p, k) 2 maja of 100 2T ( (p + ikv)? D(p', Fka) i 
i (k F ka) Geat(D', (k F ka)) 3 fo 
pi +i(k z= Ka Jv oe ka) ðv ` 


(14.139) 


Now let us substitute for the second Q 
obtain 


: S RTI a (Fikka) 
Pma = TEDEH k)m m ajel T (p+ ikv)? D(p', Fka) 


ext factor using the contribution from the b pulse to 


i (kF ka) (k F ka £ ke) e P7 afo 
7 +i(k F ka)v D(p', k F ka) ðv ` 


(14.140) 


The upper choice of the + and F signs is selected and the inverse Fourier transform per- 
formed to obtain 


b+ico 1 : 
~upper gp = kka 
ao (p,2) = ae o Qao J ak fav f (—i ) 
4k? D(p, k )m b T (p +ikv)” D(p', —ka) 
y i (k— ka) 6(k — ka + kp) eP 'T+ikg afo 
pi+i(k — ka) v D(p',k — ka) Ov 
_ baby = fe =f: dp! 
A (ka — ke)? D(p, ka — kp) M b 
i(ka oa kp) ka kp —p' T+i(ka—ky)a afo 
(p +i (ka — kob) v)? D(p', —ka) P' Sa DG =k) wv 
(14.141) 


The lower choice of the + and + signs means that ka — —ka and ky — —kp and so at 


~lower 
the end of the calculation ¢ (p, x) can also be determined by simply letting ka —> —ka 
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and kp — —ky. The inverse Laplace transform gives 


gore ay = Pad e b kios dp fæ of 2 dp’ 
E i A (ka — ke)? M Joico a. — kp) $ 


% Ka i kp 
(p +i (ka — ke) v)? D(p', —ka) P — p' — ikov 
ePt—(p—P') T+i(ka— ko) afo 
D(p—p',—ky) v 


(14.142) 


We first consider the p’ integral and only retain the ballistic term due to the pole (p — p’) — 
ikpv. Evaluating the residue associated with this pole gives p/ = p — ikyv so 


b+ice 


~ upper Pad e 1 dh 
Eaj SS: a aE d= |w 
$a (ta) 4 (ka — ko)? M Jo—ioo PD(p, ka — ke) 
i (ka _ ky) i kakp ePt—ikput+i(ka—ky)x afo 
(p +i (ka — ko) v)? D(p — ikev, —ka)  D(ikwv, =ke) Ov 


(14.143) 


The pole at (p + i (ka — kp) v)? is second-order and so the rule for a second-order residue 
should be used, i.e., 


1 d 
dp—-; g(p) = mi lim —g(p (14.144) 
fea pnt) = Mh G00 
Thus, we obtain 
Ga vent = fav 
txr) = 
Pal ) 4 (ka — kp)” m oa ka )v dp 
i (ka = kp) i kako ert- ikpuT+i(ka—kp)x afo 
D(p, Ka m kp) D(p = ikyv, —ka) D(ikw, — kp) Ov 
(14.145) 


The strongest dependence on p is in the exponential e” and so retaining only the contribu- 
tion of this term to the d/dp operator gives 


~upper Tipa O) 
Q2 (t,x ) = Ss te £ few 
4 (ka — kp)? m pe kau 
ti (Ka 2 ko) i kakp ePt—ikeut+i(ka—ky)x afo 
D(p, ka — ko) D(p — ikv, —ka) D(ikwv, =ke)  @v 


E 7 kako : Pafe eita- f ay 
(ka as kp) m 
i (ka — kp) t ei(ko— ka )ut—ikyut afo 
D (ilke — ka W, ka — ky) D(—ikav, —ka) D(ikpyv,—ka) v` 
(14.146) 
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~lowe 


The 6°” (t,x) term is obtained by letting ka — ky > — (ka — ky) and so 


~ upper ~lower 


palt, 2) =d (t,2)+¢  (t,2). (14.147) 


1/ 


If it is assumed that fo = (nvr) 1/7 exp(—v? /v2.) then the velocity integrals in the upper 


and lower terms will be 


1 . 
z | ewe v°/vpsil(ke—ka)t—ker}y = exp (- [(ks — ka)t — ker]? 03/4) 


2 
ke (kp = ka)? v2 
exp ( b ik, | 1 ; 


(14.148) 


Phase mixing due to the extreme velocity dependence of the exp(i(k, — kajut — ikyvT) 
factor will cause the velocity integral to vanish except at the special time 
kp 
z 
ky — ka 


techo = (14.149) 


when there is no phase mixing and so a finite by (t,x) signal results. This is the echo. The 
half-width of the echo can be determined by writing Eq.(14.148) as 


oe | ave 0° /vptil(ke—ka)t—kot]y — @~ (ttecno)” /(At)” (14.150) 


where 
2 


a 
|ko — kalur 


(14.151) 
is the width of the echo. 


14.3.5 Spatial echoes 


Creating spatially periodic sources with temporal delta functions is experimentally more 
difficult than creating temporally periodic sources with spatial delta functions. In the latter 
arrangement, two spatially separated grids are placed in a plasma and each grid is excited at 
a different frequency. This system is then characterized by a Fourier transform in time and 
a Laplace transform in space so that the convective derivative in the linearized Vlasov equa- 
tion has the form —iw f;+v0f, 0x giving ballistic terms proportional to exp(iw /v) instead 
of proportional to exp(—ikvt). Thus, if two grids separated by a distance L are excited at 
respective frequencies w; and w2, one grid will excite a ballistic term ~ exp(iwı (x — L) /v) 
and the other will excite a ballistic term ~ exp(iw22/v) and so the non-linear product of 
these two ballistic terms will include a factor ~ exp(iw (a — L)/v — iwg2/v)which will 
phase mix to zero except at the spatial location 


wil 


ee (14.152) 
Wy — W2 


Techo = 


If the spatial Landau damping length of the two linear modes is much less than Zecho, 
then the linear modes will appear to damp away spatially and then the echo will appear 
at £ = Techo Which will be much further away. This sort of arrangement was used to 
demonstrate echoes in lab experiments by Malmberg et al. (1968) 
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14.3.6 Higher order echoes 


If one expands the Vlasov equation to higher than second-order, then higher order products 
of ballistic terms can result. For spatial echoes high order products of the form 


N M 
ee-e] % [epe] (14.153) 
will appear and will phase mix away except at locations where 
Nw(x — L) = Muox (14.154) 


so there will be a higher order echo at the location 


MwL 
Zecho Zw ap’ 
Nw, — Mw 

14.4 Assignments 


1. Competition between collisions and quasilinear diffusion 


(a) Let vre = \/2KT./m,. and w = v,/vre and show that the Fokker-Planck 
equation in problem 1 of Chapter 13 can be written in dimensionless form as 


OFr ð (p 2+Z 3 ôFr 
ðr Ow\ T w | 4w3 Ow)” 


where 7 = vt and 
neet nA 
= Imega 
ATES MEVo 
is an effective collision frequency. 


Now show that the quasilinear diffusion equation can similarly be written in 
dimensionless form as 


ðFr _ D OF r 
ôr aw” aw 
where 
_ Det 
D = 
wy) Deol 
and 
Dan 2vKTe 
m 


e 
is an effective collisional velocity-space diffusion coefficient. 


(b) Suppose that both collisions and quasilinear diffusion are operative so that 


ðr Ow 


Dor ðw ` g: 4w? dw 


ðFr ð - Fr 2+Z 3 OFT 
w2 : 
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Show that the steady-state solution to this equation has the form (Fisch 1978) 


A 2+Z 
Fr ~ exp -f dw Ct Au 
(Dar + i) 


Sketch Fy for the cases where (i) De L >> 1 over a certain range of velocities 
(i.e., where wave spectrum is resonant with particles) and (ii) Do, << 1. How 
could this be used to drive a toroidal current in a tokamak? Using the appro- 
priate form of Doz show that the normalized quasilinear diffusion for resonant 
particles is ; 
Doaz 8T ACA l 
wmews In A 
If the wave is electrostatic then show that 


E(wr/v,t) ~ klel? 


where k is the wavenumber of the waves resonantly interacting with the particles 
so that 3 r 
= 87° kid 
Dot(w) = ———s 
Qr(w) meww?, InA 
What sort of waves would be most efficient at driving current, large or small k? 
Would this current drive work best in high or low density plasmas? 


15 


Wave-wave nonlinearities 


15.1 Introduction 


Wave non-linearity is a vast subject that is not particularly specific to plasma physics be- 
cause similar non-nonlinear behavior can occur in any medium that supports wave propa- 
gation. However, because of the enormous variety of plasma waves, there is usually at least 
one plasma context where any given type of wave nonlinearity is an important issue. 

Wave nonlinearity occurs when non-linear terms become sufficiently large that they 
cannot be ignored. Three general types of non-linear wave behavior will be discussed: 
mode-mode coupling instabilities, self-modulation, and solitons. Before discussing these 
various behaviors in detail, a qualitative overview will first be presented showing how cer- 
tain wave nonlinearities typically present themselves. 

Mode-mode coupling instabilities 

Suppose a linear wave is made to propagate in a plasma by inserting in the plasma an 
antenna connected to a sine wave generator tuned to an appropriate frequency. As a spe- 
cific example, suppose the plasma frequency is wpe/2m7 =100 MHz and the generator is 
tuned to w/27 =500 MHz so as to excite an electromagnetic plasma wave with dispersion 
relation w? = Wre + k?c?. The wave propagates through the plasma and is detected by 
a distant receiving probe. Instead of connecting the receiving probe to a radio receiver or 
an oscilloscope, the receiving probe is connected to a spectrum analyzer, a device which 
provides a graphic display of signal amplitude versus frequency. The signal shows up on 
this display as a sharp peak at 500 MHz as sketched in Fig.15.1(a). If the signal gener- 
ator amplitude is increased or decreased, the spectrum analyzer peak goes up or down in 
proportion, thereby indicating the signal strength. 

An odd behavior is observed when the generator amplitude is increased above some 
critical threshold. At generator amplitudes below this threshold, the spectrum analyzer 
displays just the single peak at 500 MHz, but above this threshold, two additional peaks 
abruptly appear at two different frequencies and these new peaks have amplitudes consider- 
ably lower than the 500 MHz peak. These two additional peaks are typically at frequencies 
which sum to 500 MHz and the frequencies of the two additional peaks are typically very 
different from each other. For example, one of the two additional peaks might be at 1 MHz 
and the other at 499 MHz. These two peaks are called daughter waves and their spectrum 
is usually broader and their amplitude “springier” than the 500 MHz “pump” wave. If the 
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generator amplitude is increased further, the pump wave amplitude no longer increases in 
proportion to the generator amplitude, and instead the daughter wave amplitudes increase. 
One gets the impression that the plasma does not like to have too much power in the pump 
wave and when the above-mentioned threshold is exceeded, excess power spills into the 
daughter waves at the expense of the pump wave. If the power is increased still more, the 
daughters become quite large, and then at another threshold, the upper daughter might sud- 
denly spawn its own pair of low and high frequency daughters. The spectrum becomes 
quite complicated and the amplitudes of the various spectral components depend sensi- 
tively on the generator amplitude and on plasma parameters. One gets the impression that 
energy is sloshing around between the pump wave and the daughter waves. 


(a) (b) (c) 


amplitude amplitude amplitude 
lL f f f 
500 500 i AN: 
MHz MHz 1 499 500 
MHz MHz MHz 


Figure 15.1: Frequency spectrum showing onset of decay instability at high power when a 
500 MHz sine wave generator excites a wave in plasma. Low and intermediate generator 
power as in (a),(b) provide linear response, but high generator power as in (c) results in 
decay instability with daughter modes appearing at 1 MHz and 499 MHz. 


This behavior can occur at surprisingly modest generator power levels where it might 
have reasonably been expected that nonlinear behavior would be negligible. The springi- 
ness of the daughter wave amplitudes suggests that some kind of resonant effect makes 
the nonlinearities more important than would be expected. The behavior sketched here 
is sometimes called a parametric decay instability (Silin 1965) because the pump wave is 
considered as decaying into the daughter waves. It has analogies to photon decay and in a 
sense can be considered as the classical limit of photon decay. Whether this sort of insta- 
bility is good or bad depends on the context. If the goal is to propagate a large amplitude 
wave through a plasma and the wave decays into daughters, then the instability would be 
bad and efforts would be required to avoid it. On the other hand, if one of the daughter 
waves is normally difficult to excite and has some beneficial aspects, then the decay insta- 
bility provides a means to access the desired daughter wave. Another possibility is that the 
onset of the decay could be used as a diagnostic to provide information about the plasma. 

Self-modulation 

A closely related situation is where the low-frequency daughter wave is at zero fre- 
quency. In this case the pump wave beats with itself and so modulates the equilibrium via 
a ponderomotive non-linear pressure which acts to expel plasma. If this happens, increas- 
ing the generator amplitude above the nonlinear threshold causes the pump wave to dig a 
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hole in the plasma density. The pump wave can dig itself a channel in the plasma and then 
this channel can guide, focus, and even trap the pump wave. 

Solitons 

Plasma waves are dispersive in general so, after propagating some distance, an initially 
sharp pulse will become less sharp because the various components of the Fourier spec- 
trum constituting the pulse propagate at different phase velocities. However, it is found in 
certain situations that pulses with an amplitude exceeding some critical value will prop- 
agate indefinitely without broadening even though the medium is nominally dispersive. 
This high amplitude non-dispersive pulse is called a soliton and its existence results from 
nonlinearities competing against dispersion in a way such that the two effects cancel each 
other. 


15.2 Manley-Rowe relations 


Before investigating wave-wave nonlinearities, it is instructive to examine a closely 
related, but simpler system consisting of three non-linearly coupled harmonic oscillators 
(Manley and Rowe 1956). This simple system consists of a particle of mass m moving in 
a three dimensional space with dynamics governed by the Hamiltonian 


I et eee 
H = > (PL + PE + PS) +5 (K1QÌ + K2Q3 + 63Q3) +AQIQ2Qs. (15.1) 


In the A — 0 limit this Hamiltonian describes three independent harmonic oscillators hav- 
ing respective frequencies wy = \/K1/m, w2 = \/K2/mand w3 = ./K3/m. In the more 
general case of finite A, Hamilton’s equations for the P, , Qı conjugate coordinates are 


OH 


È = 30, = K1Q1 — AQ2Q3, (15.2a) 
. OH 
Q = T (15.2b) 


with similar equations for the P2 Q2 and Pz, Qs conjugates. Using relationships such as 
Qı = Pı /m, three coupled oscillator equations result, namely 


7 A 
Qı + wiQi = -—Q2Q3 
m 
7 A 
Qo +w3Q. = -—Q:Q3 
m 
7 A 
Q3+w3Q3 = -^QıQə (15.3) 
m 


For small 4, each oscillator may be assumed to oscillate nearly independently so that during 
each cycle of a given oscillator, the oscillator experiences only slight amplitude and phase 
changes due to the nonlinear coupling with the other two oscillators. Thus, approximate 
solutions for the oscillators can be written as 

Q(t) = A, (t) cos [wt +r On (t)] 

Qo(t) = A(t) cos [wot + ô2(t)] 

Q3(t) = Ag (t) COS [wst T 63(t)| (15.4) 
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where it is assumed that 
A; : 
rF << wj and 6; << wj. (15.5) 


j 
Each oscillator is considered as a linear mode of the system and, from now on, the term 
mode will be used interchangeably with oscillator. The energy associated with each mode 
is 1 1 1 

y. — 2 N2 MA2 M2 M 242 

W= m + z“iQj = z% + 3%Q3 = zA (15.6) 
and the associated action is 


2T 
S = $ Pag; = m f Q;aQ; = -moja f sin Yd cos Y; 
0 


2T 
ed | | m 
= musa f sin? Y; dy = w54} (15.7) 


where p; = wjt + 6;(t) is the phase of the mode. 
Since A; and 6 j are small compared to w jÅ j and wj 6 j respectively, the former terms 
may be dropped when calculating the second derivative of Q; which is therefore 
Q; = —w? A; cos(w;t + 6;) E 2w; Åj sin(wjt + 6;) _ 2; Ajd, cos(wt + 6;). (15.8) 
When the above expression is inserted into Eqs.(15.3) the terms involving ws cancel and 
what remains is 
Qu, A; sin(w jt + ô;) Xr 
. = —A;,A t+6 wit +6 
+2w;Aj6; cos(w jt + 4;) m 1 008(W i r) cost = 1) 


cos( (wk + wi)t + ôk + 61) 
+ cos((wp = w))t + Ôk = 61) 


À 
er A 


(15.9) 


It is necessary to write out the three coupled equations explicitly because the coupling 
terms on the right hand side are not fully symmetric. To identify resonant interactions it is 
assumed that 


w3 = wi +w (15.10) 
O(t) = ô +ô2-— ô3 (15.11) 
in which case the coupled mode equations can be written 


cos((w2 + w3)t + ô> + 63) 


Ae sin(w ,t + 61) ] AA A3 
. = + wa — w3) t + 62 — 6: 
| + Aıôı cos(wıt + 61) | 4mwı Cosas) Pos 03) 
resonant at —w 1 
Set 5 cos((w1 + w3)t + 61 + 63) 
Ag sin(wet + 62) 7 AA, A3 4+cos((wy 2 Ww) ies 53) 
+ Agd2 cos(weat + 62) | A4mwy - ma 
resonant at — w2 
5 a A A resonant at +w3 
Ag sin(w3t + ô: AÀ Aa 
hauti] 7 ine | toiri ti 
a ee 3 | A cos((wı — w2) t+ ô — ô2) 
(15.12) 


432 Chapter 15.. Wave-wave nonlinearities 


Using wo — w3 = —Wy, 62 — 63 = 0 — 64, W1 — W3 = —w2, 6, — 63 = 0 — bq and 
discarding non-resonant terms, these become 


i : ADA: 
Aj sin(wıt 61) Ayo cos(wıt T 61) = ree cos(6 wt 61) 
Ag sin(wet 62) Ade cos(wet oF 62) = AdiAS os(0 Wot 62) 
A4mwe 
Az sin(w3t 63) A363 cos(wst + 63) = aii os(0 H w3t + 63 ) 
Amws3 
(15.13) 
or 
Ai sin(wıt +61) | — AAgAZg | cosAcos(ayt+ 61) | 
+ A161 cos(wit + 61) | ~  dmw, | +sin ésin(wit + 61) | 
Ag sin(wat +ô) | _ AA, A3 | cos @cos(wet + 62) 
+ A262 cos(wət + 62) | ~ 4mws | +sin Psin(wet + 62) | 
Ag sin(wst +63) | _ AMA | cos @cos(w3t + 63) 
+ A363 cos(w3t + 63) | ~ 4mwg | —sinésin(w3t + 63) | 
(15.14) 
Matching the time-dependent sine and cosine terms on both sides gives 
À = AAT g (15.15a) 
4mwı 
Ay = RATES rg (15.15b) 
Amwy 
Ac AA 5G (15.15c) 
4mw3 
and 
Aiô = AAaAS cos 0 
4mwı 
Aod. = AAi cos 0 
Amwy 
A383 = ATA ai (15.16) 
4mw3 


These equations can be used to establish various conservation relations. Multiplying 
Eqs.(15.15) by wi Áj, and summing gives 


(w2A? + W2AR tu Ag) = CaTa s) aAa ing 
4m 
so 

w? A? + wA? + wA = const. (15.17) 
This shows that the sum of the energies of the three modes is constant, but allows for energy 
transfer back and forth between the three modes. 
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A corresponding set of relations for the action can be obtained by multiplying pairs 
of equations in Eq.(15.15) by wj A; and then either adding or subtracting. For example, 
multiplying the first pair by wj A; and subtracting gives 


ld 
a (wy A} — w243) = 0. (15.18) 


Appropriate adding and subtracting in this manner gives 


const. 


wi A? — we. A? 
wA? +w3A2 = const. 
Ww A? + w3.A2 const. (15.19) 


These relationships can be expressed in a manner analogous to quantum principles by defin- 
ing the effective “quantum number’ of a mode as its ratio of energy to frequency, 


wes 


m 
I Lge DJ 
Wj 2 


wjA?. (15.20) 


Itis clear that N; is the same as the action except for an unimportant constant factor. Thus, 
the action conservation rules can be recast as 


Ni- Nog = const. 
Ni+N3 = const. 
Nog+N3 = const. (15.21) 


or if changes in action are considered 


AN, = +AN»2 
AN> = —ANs. (15.22) 
This provides an action accounting scheme such that a change AN3 = —1 can be consid- 


ered as a mode 3 "photon" decaying (or equivalently disintegrating) into a mode 1 photon 
(AN, = +1) and a mode 2 photon (AN2 = +1), all the while satisfying conservation of 
energy; this is sketched in Fig. 15.2. 
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@3 photon 


decay 


@, photon @2 photon 


| 


energy conservation condition: @3; = @1 + @2 


Figure 15.2: Photon at frequency w3 decaying into a photon at frequency w, and a photon 
at frequency w2. 


An additional conservation equation can be obtained by subtracting the last of 
Eqs.(15.16) from the sum of the first two to obtain 
Ô = 6,+62-43 
AA2A3 AA Az AA, Ae 9 
H cos 
AmAw, 4AmAqwg 4mA3w3 


A Ay l Az \ cos0 
($ Ei z) sin 0 oe 
and then integrating to find 
A, Az Az cos 0 = const. (15.24) 


We now consider some solutions to the system of equations given by Eqs.(15.15b)- 
(15.15c). Suppose that Az >> Aga, Aj initially. In this case Eq.(15.15c) gives Az ~ const. 
Solving Eq.(15.15b) for Ay and substituting the result in Eq.(15.15a) gives 


1 d 1 dA? X42 
sin dt \ sinb dt 


which has exponentially growing solutions if wyw2 > 0. By defining 


= 15.2. 
16m2w we a) 


t 
r= f dt’ sin 0(t’) (15.26) 
0 
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it is seen that 


dA drd . dA» 
a dt ar a ce 
and so 2 yas 
= 3a Ay 15.2 
dr? 16m?wywe ‘ (925) 


Since wyw2 > Oand w1 + w2 = w3, if w3 is chosen to be positive, then both wı and w2 
must both be positive and therefore w3 > w 1,W 9. If A2 grows, then because of the action 
rules, A; must grow in the same proportion. Furthermore, as A; and Ag grow, A3 must 
decrease until the approximation A3 >> Aj, Az fails. This process can be considered as 
a high energy “photon” with frequency w3 decaying into an w, photon and an we photon 
where the latter two photons have lower energy than w1. Equation (15.28) shows that the 
decay of the mode 3 photon into mode 1,2 photons can be characterized by an exponential 
growth of the mode 2 amplitude, 

Ag ~ e7 (15.29) 


where 
A A3 


2 4m, /W1w2 
is the instability growth rate. Mode 3 is called the pump mode since it supplies the energy 
for the instability. Modes 1 and 2 are called the daughter modes. 


(15.30) 


15.3 Application to waves 


Nonlinearities in wave equations give rise to coupled systems of equations similar to Eqs.(15.3). 
Suppose Q now refers to a plasma parameter, say density, and that the plasma can support 
three distinct waves (modes) having respective linear density fluctuations 


Qi (x,t) = Aj (t) cos(kı T. E wit _ ôi (t)) 
Q2 (x,t) = Ag (t) cos(k2 ar Saar Wot = 62 (t)) 
Q3 (x,t) = Ag (t) cos(k3 “xX — west a 63 (t)); (15.31) 


the density Q therefore has the functional form 


Q(x,t) = Q1(x,t) + Qo(x,t) + Q3(x,t). (15.32) 


This situation is mathematically analogous to the coupled oscillator system discussed in 
Sec.15.2 if the amplitude Q; of each of the three waves is considered to be an effective 
canonical coordinate. Each wave satisfies a linear dispersion relation w; = w; (k) and non- 
linearities in the wave equations provide a mutual coupling between the modes analogous 
to the coupling between the different directions of motion for the oscillating mass discussed 
in Sec.15.2. The exp(ik - x) dependence of the waves suggests the need for a wavenumber 
selection rule 

k3 =k, + ke (15.33) 
analogous to the frequency selection rule Eq.(15.10). Using the wavenumber selection rule 
and generalizing the phase offset definition to be 6; (t) = 6,(t) — kj - x, it is seen that 
the coupled wave equations become identical to the coupled oscillator equations with ô; 


436 Chapter 15. | Wave-wave nonlinearities 


replacing 6;. Hence, so long as the selection rules w3 = wy + wy and k = k; + kg are 
satisfied, a high frequency wave w3 should decay spontaneously into two low frequency 
waves w1, w2 providing there is a suitable coupling coefficient. 


15.3.1 Examples of nonlinearities 


Nonlinearities can be divided into two general types: (1) two high frequency waves beating 
together to give a low frequency driving term and (2) a high frequency wave beating with a 
low frequency wave to give a high frequency driving term. As shown below, the first type 
of nonlinearity has the form of a radiation pressure (also known as a ponderomotive force) 
while the second type of nonlinearity has the form of a density modulation. 
1. Beat of two high-frequency waves driving a low frequency wave (ponderomotive force). 
Since ion motion is negligible in a high frequency wave, all that is required when con- 
sidering nonlinearities is the electron equation of motion, 


1 


Mene 


Oue 
ot 


For simplicity, it is assumed no equilibrium magnetic field exists, so the only magnetic 
field is the wave magnetic field. Tildes are used to denote linear quantities to avoid 
confusion with the subscripts 1,2,3 which denote the low frequency daughter, high 
frequency daughter and pump wave respectively. Because the pressure gradient term 
in the electron equation of motion provides only a small correction at high frequencies, 
the linear motion for either the pump or the high frequency daughter must thus be a 
solution of 


ayaa 1 (E +u. x B) — VP.. (15.34) 


=E, (15.35) 


The electron quiver velocity is defined to be 


qe f= 
was = i Edt’ (15.36) 


A Me 
which is the solution to Eq.(15.35) for both electron plasma waves and electromag- 
netic waves. The main non-linear terms in Eq.(15.34) are —ūe:Vūe+qe (a. x B) /Me; 
non-linearity in the pressure gradient is ignored because, by assumption, this term is 
already small. In order to obtain B, Faraday’s law is integrated with respect to time 
giving 

a t at m., 
B=-Vx | Edt =-— V xa". (15.37) 
de 
The two nonlinear terms can be combined using Eq.(15.37) to form what is called the 
ponderomotive force or radiation pressure, 


—a,- Vūe + (a xB) —a!. Val — (a2 x V x āū!) 
Me 
1 ae 
= -5v (a). (15.38) 


If only one high frequency mode exists and beats with itself, then the ponderomotive 
force —V (a) 2 /2 is at zero frequency but if the beating is between two distinct high 
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frequency modes, then the ponderomotive force contains a term at the difference (i.e., 
beat) frequency between the frequencies of the two modes. Thus, at low frequencies 
the electron equation of motion, Eq.(15.34) becomes 

üe qe = 1 ~ 1 


= ~h) 2 
ap nee aE 5 V (ue) (15.39) 


we PupN 2. ; 
where only the beat frequency component in (H) is used. The ponderomotive force 


provides a mechanism for high-frequency waves to couple to low frequency waves. 
It acts as an effective pressure scaling as men (ab) 2 /2 and so in a sense, the quiver 
velocity i acts as a thermal velocity. The ratio of ion radiation pressure to the ion 
pressure is smaller than the corresponding ratio for electrons by a factor of me/m; 
because the ion quiver velocity is smaller by this factor. Thus, ion ponderomotive 
force is ignored since it is so small. 


2. Beating of a low frequency wave with a high frequency wave to drive another high 
frequency wave (modulation). By writing Ampere’s law as 


E sute 2 JE 
V x B= Up >, (oqo üo ae N0qo Ug) F MoO -ay (15.40) 


it is seen that density fluctuations provide a nonlinear component to the current den- 
sity. The nonlinear term can be put on the right hand side to emphasize its role as a 
nonlinear driving term so that Ampere’s law becomes 


= JE as ee 
Vx B— poe0 5 Ho 2 nodoite = “Ho Dodo ie. (15.41) 


The nonlinear term is assumed to be a product of a high frequency wave and a low 
frequency wave. The linearized continuity equation gives 


OMe fe 
r Sh Way (15.42) 
showing that i, ~ U, /w so that the product n} u? is much larger than the product 


neul where land h refer to low and high frequency waves. Thus, the dominant effect 


of a low frequency wave is to modulate the density profile seen by a high frequency 
wave. 


15.3.2 Possible types of wave interaction 


As discussed in Sec.4.2, three distinct types of waves can propagate in an unmagne- 
tized uniform plasma and these waves have the dispersion relations: 


wes We + kc, electromagnetic wave 
P= We (1 + 3k? Ne) , electron plasma wave 
; ke ; 
w? = ——% ion acoustic wave. (15.43) 


2 
1+k2d32,, 
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If three waves satisfy the selection rules 


W3 = Wi +w (15.44) 
kg = k, +k, (15.45) 


they can only have the same dispersion relation if w? depends linearly on k? so that the 
magnitude of k is linearly proportional to the magnitude of w. This is not so for the modes 
listed in Eq.(15.43) except for the kabe << 1 limit of the ion acoustic wave; applying 
the selection rules to this w? = k?c? limit of ion acoustic waves gives nonlinear inter- 
actions between various sound wave harmonics, e.g., the first and second harmonic can 
interact with the third harmonic. Since this limit corresponds to the well-known nonlinear 
steepening of ordinary sound waves it will not be discussed further. 

The situations which are of particular relevance to plasmas are where modes satisfying 
different types of dispersion relations interact with each other, for example an electromag- 
netic wave interacting with an electron plasma wave and an ion acoustic wave. The various 
possibilities for non-linear interactions of the three types of plasma waves are the follow- 
ing: 


pump hf daughter lf daughter common name 

W3 W2 Wy 

em em Langmuir stimulated Raman backscatter 
em em acoustic stimulated Brillouin backscatter 
em em zero frequency _ self-focusing 

em Langmuir Langmuir two-plasmon decay 

em Langmuir acoustic parametric decay instability 
Langmuir Langmuir acoustic electron decay instability 
Langmuir Langmuir zero frequency caviton 


Here the abbreviations em, Langmuir, and acoustic stand for electromagnetic wave, 
electron plasma wave, and ion acoustic wave respectively and the ordering is by progres- 
sively lower frequency, column by column taking into account that w3 > w2 > wı and 
electromagnetic waves have higher frequencies than Langmuir waves which in turn have 
higher frequency than ion acoustic waves. 

In each case the low frequency daughter modulates the density and beats with either the 
pump or the high frequency daughter to provide a high frequency nonlinear current as given 
by Eq.(15.41) while the pump and the high frequency daughter beat together to provide a 
ponderomotive force that couples to the low frequency wave. Because c? >> v3, >> c? 
the electromagnetic wave has a stronger dependence on k? than does the electron plasma 
wave which in turn has a stronger dependence on k? than does the ion acoustic wave. 
This means that if kı, k2, and kg are all in the same direction and Eq.(15.45) is satisfied, 
Eq.(15.44) cannot be satisfied since w3(|k3|) = w3(|ki| + |k2|) > wi(|ki|) + we(|ka)). 
Thus, the frequency and wavenumber selection rules can only be satisfied if k, , k2, and kg 
are not all in the same direction. The pump wave direction is taken to define the positive 
direction, i.e., kz defines the positive direction so one of the daughter waves must propagate 
in the direction opposite to the pump. 

The electromagnetic wave in a given plasma always has a higher frequency than the 
electron plasma or ion acoustic waves. As a consequence, the low frequency daughter can- 
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not be an electromagnetic wave because then all three waves would have to be electromag- 
netic waves which would then violate the requirement that the dispersion relations cannot 
all be the same. The low frequency daughter must therefore be either an ion acoustic wave 
or an electron plasma wave. Similarly, the high frequency wave (pump or high frequency 
daughter) cannot be an ion acoustic wave and so must be either an electromagnetic wave or 
an electron plasma wave. Thus, the various interactions tabulated above can be accounted 
for by establishing the appropriate coupling equations for the following four possibilities: 
high frequency wave is an electromagnetic wave or a Langmuir wave, low frequency wave 
is a Langmuir wave or an ion acoustic wave. 


Low frequency wave is an ion acoustic wave On assuming quasi-neutrality which 
corresponds to assuming Kens, << 1, the low frequency electron and ion equations of 
motion may be approximated as 


Ou, de = kle. 1 ~hy2 

= E vV V 15.4 
Ot Me Men $ 2 (2) (13:46) 
Ou; qi = 

= = E: 15.47 
; N (15.47) 


here only the low frequency beat component of (a2)? is used. Also, as shown earlier, the 
ion ponderomotive force is negligible and therefore ignored. Because the electron mass is 
very small, the left hand side of the electron equation of motion is dropped in which case 
this equation reduces to the simple force balance relation 


È Vā- =V (ait) ~0. (15.48) 
Me Men 2, 


Using Eq.(15.48) to eliminate E from the ion equation gives 


Ou; Te s 1 e 
ui K Vii m 


ot min 2 mi 


V (ay. (15.49) 


e 


Because quasi-neutrality is assumed ñ; = me = n, and so the time derivative of the ion 
continuity equation can be written as 


rn Ou; 
DE +nv - le 0. (15.50) 
Substituting for Ou; /ðt, the above equation becomes 
Orn KT. ~. Ime ~h\2 
ot? ae ( ak | a O ) ra ay 


which can be written as an ion acoustic wave equation with a non-linear coupling term due 
to the electron ponderomotive force, 


(15.52) 


If the electron quiver velocity is considered to behave as an effective thermal velocity, then 
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the low frequency beat component of (a) * can be considered as modulating the effective 
electron temperature. Specifically, 


3 ae me (tit)” 
NeKL. +nKT. = NekT. + a a (15.53) 
and so F 
Te e (ù; 
ace ac +n(2) = ie tni iene (te) (15.54) 
Mi 2m; 


which is consistent with Eq.(15.52). 


Low frequency wave is an electron plasma wave In this case the low frequency wave 
frequency is above Wye so that the ions form a stationary background. Now only the 
electron dynamic response matters and the density perturbation is not quasi-neutral. The 
electron equation of motion for the low frequency wave is 
ou - 3KT. n 
e Z de E “Vite 
n 


Ve aed 
Ai me T V (ùk) (15.55) 


2 e 
where the 3 comes from the adiabatic pressure perturbation. Combining this with the time 
derivative of the electron continuity equation, 


Dfe Oue 
a2 + nV. ( Dt ) =0, (15.56) 
gives 
Orne Ge = 3kTe 1 ~h\2 
PYE PAV n mena zV (ite) ) =0. (15.57) 


This may be simplified by invoking Poisson’s equation 


= N 
V- E =— ñeqe (15.58) 
E0 
to obtain 
o ie Siw 3KTe 2- N 2 ~p\2 
DE + Wpeħe — M he= 5 V (ai?) : (15.59) 


The left hand side is the electron plasma wave equation (Langmuir wave) and the right 
hand side provides the nonlinear drive (or coupling) due to ponderomotive force. Again, 
using Eq.(15.53) it is seen that the ponderomotive force term acts like a modulation of the 
electron temperature such that nk Te — AKT. +nkT. where T, is due to the high frequency 
electron quiver velocity. 


High frequency wave is an electron plasma wave In this case the wave is electro- 
static and so there is no high frequency oscillating magnetic field. The non-linear continuity 
equation is 


OMe Z ra 
= +n V- te =—V: (ñeūe) (15.60) 
and taking a time derivative this becomes 
ite üe ð _ 
ar a pee aT (15.61) 


ot? Ot Ot 
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However, the high frequency linear electron equation of motion is 


Oue de = 3KTe 


= E Vite 15.62 
ot Me Men 4 (19.62) 
so Eq.(15.61) becomes 
O e 2 3KTe 9. 9 on 
yD) + gets — =a Vite = tae (fete). (15.63) 


In principle, there are two possible components in the right hand side nonlinear term since 
rete = ALU? + APAL where land h refer to low and high frequency waves. If both high 
and low frequency waves are electrostatic, then u. ~ E= ~ikd and V-u,. = ik- ūe = ik- 
a. ~ k?ġ where 4 is the electrostatic potential. The linearized continuity equation gives 


fe k-ū k6 


= ~ (15.64) 
n w w 
and so se 
kipi, 5 
Bags EEE 
Loh ae RP fh 
ES ee Le (15.65) 
Aral ee wy / ky 
Oo, Kigi 


since the phase velocity of the high fiequenty mode is much higher than the phase velocity 
of the low frequency mode. Thus, the ñLū? term dominates and the other term can be 
discarded. If the high frequency wave associated with a” is an electromagnetic wave, then 
the electric field is transverse so V - a ~ V - E? = 0 in which case there is no high 
frequency density fluctuation, i.e., ñ = 0. Again, only the ALĀ? term is to be retained. 
Hence, the high frequency wave equation becomes 

OW fie 2 - 3KTe 


—+ Wyte — 
Ot? pe’ Me 
h 


If the high frequency wave is electromagnetic, then V- (ña ) 


2 ge 
Vite = -V (fie). (15.66) 


h) = a”. Vil showing that the 
non-linearity consists of a density modulation due to the high frequency motion across the 
density ripples of the low frequency mode. For purposes of consistency, it is worthwhile to 
express Eq.(15.66) in terms of electric field using 


~ 1 
V- E = — ñeqe (15.67) 
£0 
so 
- 3KTe 25 ð 1 
we Kes) _ FP to ysl, oh 
V. ee ee Se Ha 7A VE San (7iqet?) (15.68) 


which can be integrated in space to give the general expression for a high frequency electron 
plasma wave with nonlinear coupling term, 


PE Sa Sk aoa 
Sat we -ey = 


= T ( (15.69) 


LÊ (ib gett) 
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High frequency wave is an electromagnetic wave Here the wave is transverse so 


V- E =0 and the curl of Faraday’s law becomes 


ƏV xB 
=a 


Substituting Eq.(15.41) and using the linear equation of motion gives the general expression 
for a high frequency electromagnetic wave with non-linear coupling term, 


VE= (15.70) 


PE 25 292 EO E 
opt wpeE—c V E = = (7, qe?) ; (15.71) 
the term ñ? qe} has been dropped for the reasons given in the previous paragraph. 


Summary of mode interactions Examination of the various combinations consid- 
ered above shows that the non-linear coupling acting on high frequency modes (either pump 
and high frequency daughter) is an effective modulation of the density experienced by the 
high frequency wave, i.e., We => Wee + we i! /n. On the other hand the non-linear cou- 
pling acting on the low frequency daughter comes from the electron ponderomotive force 
which effectively modulates the electron temperature experienced by the low frequency 
mode, i.e., RKTT. — AKT. + NK (a2)? RA 


Coupled oscillator formulation To see how the coupled wave equations can be ex- 
pressed in terms of coupled oscillators, the specific situation of an electromagnetic wave 
interacting with a Langmuir wave and an ion acoustic wave is now considered. In this case, 
the three coupled equations are 


22Ez 


2g e er gs 2 
Pe + wpe E3 — cv E3 = ma at (fi U2) 
o Ez 25 3kTe oza de O 
ot? gee = Me YES eo ôt (Mts) 
OF, E 
F — EV = nV? (fi - Gs) (15.72) 
where the relation 
1 py? lites E = Z = 
5 (ue) = 5 (fe + tis) - (fe + fis) = (ñz - tts) (15.73) 
w1,kı 


has been used and the angle brackets refer to the component oscillating at the beat fre- 
quency w , = w3 — we and having the beat wavevector kı = kg — ko. 

The subscript e has been dropped from all dependent variables because they all refer 
to electrons. Since the acoustic wave frequency is much smaller than wpe, it is possible to 
approximate 


ð ou e = 

5 Pa) a = fa Be, (15.74) 
Using the quiver relation Eq.(15.36), the product of the high frequency velocities can be 
expressed as 


2 - - 
(ñz - iy) = — = — (B Ès). (15.75) 
MeW2W3 
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Although the two velocities are oscillating 90° out of phase relative to their respective elec- 
tric fields, the non-linear product of the velocities has the same phase behavior as the elec- 
tric fields. This is because the nonlinear product of the velocities scales as (cos wat cos w3t) 
and the nonlinear product of the electric fields scales as (sin wat sin wt) so that the dif- 
ference between these scalings is (cos wet cosw3t — sin wat sin wst) = (cos(w2 + ws)t) 
which vanishes because w2 + ws is non-resonant. The system of equations thus becomes 


o? 2 202 \F 2 g 
(i Wie eV" | Es = =. F E2 
Ə 3KT. 2 My = 
(2 i we = ve) E> = =w = Es 
ə? 202 ny a 2 F 
ave = Sey (E2 - Es). (15.76) 


If different modes are used, e.g., a Langmuir mode decaying into another Langmuir mode 
or if the low frequency mode is a Langmuir wave, the left hand side wave terms will be 
changed accordingly, but the right hand coupling terms will stay the same except if the low 
frequency wave is a Langmuir wave in which case the product m,;m, must be replaced by 
m2 in the denominator of the right hand side of the third equation. 

The right hand coupling terms can be made identical by defining a renormalized density 
perturbation 


pra (15.77) 
n 
so that the equations become 
o? 2 202\F 2 ý = 
(Gat We—€ V°)E3; = —Whe ra E» 
Oo og E oe 
a2 7 2D ado a 
2 ev)ý = -a _ h, Ë. (15.78) 
Ot? MjMeW2W3 
Equating the coefficients of the second and third terms gives 
ie yg ES (15.79) 
5 qekı ` 
so the equations become 
o? 2 202\F E 
apt Wet V°)E3 = —Aw Eo 
2? ; 3KTe = 
(2 Hwe — Fa v2) Es = —\WEs 


a? ‘ ee 
($-a) = —dE,- Es. (15.80) 
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where ; 
WedeN1 
A= SS. 15.81 
VIN{MeW2W3 
Defining the mode frequencies as 
we = we + kže 
Ds = Wee + 3k2KT. /me 
we ee (15.82) 
the coupled equations become 
2? - ~= 
(Fz $ 4) E; = —Aw Es 
2? - ~= 
(sat 4) Eo = -\WEs3 
2? ~ - = 
(Fs tut) y = —AEo- És (15.83) 


which is identical to the coupled oscillator system described by Eq.(15.3) if m is set to 
unity in Eq.(15.3) and Eg is parallel to E3. Using Eq.(15.30), the nonlinear growth rate is 
found to be 

AE3 
4 ./wywe 
lw»; /w3, dels 


= 2 ky : (15.84) 
4 we Wy MeW3 


15.4 Non-linear dispersion formulation and instability 
threshold 


An equivalent way of considering the effect of nonlinearity is to derive a so-called nonlin- 
ear dispersion relation (Nishikawa 1968b, Nishikawa 1968a). This method has the virtue 
that both wave damping and frequency mismatches can easily be incorporated. To see how 
damping can be introduced, consider an electrical circuit consisting of an inductor, capaci- 
tor and resistor all in series. The circuit equation is 
Q p Q 
dt2 ° dt C 
where Q is the charge stored in the capacitor and the current is J = dQ/dt. The general 
solution is Q ~ e”* where w satisfies 


L 


=0 (15.85) 


R 1 
2 . 
: ioe eh 15. 
w + iw TIC (15.86) 
Solving for w gives the usual damped harmonic oscillator solution 
iR 1 R? 
eae (15.87) 


2L LC AL?" 
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Thus, if 1/LC >> R/L the imaginary part of the frequency is w; = —R/2L and the 
real part of the frequency is w, = +1/V LC. This means that the resistor can be identified 
with damping by the relation R — —2w; L. Interpreting Q; now as the dependent variable 
for oscillation mode j, the appropriate differential equation for the damped mode can be 


written as 29 Q 
dQ; dQ; 2 
TI T; a wQ; =0 (15.88) 
where T; = |Imw,| is the linear damping rate for mode j. Mode j therefore has the time 
dependence exp(+iw;t — T;t) where both w; and T; are positive quantities. 
Thus, when dissipation is included, the typical system given by Eq.(15.83) generalizes 
to 


o? ð 2\ = ~ = 
($ Tsz 4) Es = —d\~ Ep 
o? ð - ~ = 
($ a 3) E> = —\WE3 
a? ð yo ae Ls 
(Feng, + 2) wo = —)\Eo- Es. (15.89) 


Let us now consider again the situation where the pump wave Es is very large and so 
may be considered as having approximately constant amplitude. The decay waves then do 
not grow to sufficient amplitude to deplete the pump energy and so only the last two of the 
three coupled equations have to be considered. We define S2 as a unit vector in the direction 
of E» so that 


Ey = 8+ Ep (15.90) 
and so the last two of the three coupled equations become 
a a one ae Se 
(Fs tel at 4) Ey = —Aso- Ez 
o? ð aN = Foo- E 
PA + Tia +wily = —AE282- Es. (15.91) 


The wavevector selection rules are assumed to be satisfied but a slight mismatching in the 
frequency selection rules will be allowed. The pump wave is now written as 


8 - Es = Z3cosw3t (15.92) 


where Z3 is the effective pump wave amplitude. The coupled daughter equations then 
become 


ə 0 ~ AZ: iw iw oi 
Ca n 8) Be Gee ene) e 
Ə? o ~ AZ3 , 7 b 
(Faery, + 4) y= ae (els! + pte) Ez. (15.93) 


The Fourier transform of a quantity f (t) is defined as 


f(w) = / dt f(t)? (15.94) 


446 Chapter 15.. Wave-wave nonlinearities 


where the sign of the exponent is chosen to be consistent with the convention that a single 
Fourier mode has the form exp(—iwt). The Fourier transform of f(t)e='”2° will therefore 
be 


J dt (f (tjet) el” = f(w ws) (15.95) 
and the Fourier transform of Of /Ot will be 


fa (310) e*t — —iw faros = —iwf (w). (15.96) 


Application of Eqs.(15.95) and (15.96) to Eqs.(15.93) gives 


~ AZ3 f~ ~ 
(—w*—2iwl'2 + w3) Elw) = > [a ( Hwg) + y (w wa)| 
2 : Wo AAS f= I 
(=w —2iwl + w?) p(w) = -5z B (w + w3) + E2 (w — ws)| .(15.97) 
By defining a generic linear dispersion relation 
€;(w) = —w?—2iw D5 + wi, (15.98) 
these equations can be written as 
~ AZs = ~ 
eaw) Ëu) = -2 |% (w +w) + ý% (w wa)| (15.99) 
+ AZ3 [= ~ 
Er(w)v(w) = rma B (w + w3) + Bo (w — ws)| i (15.100) 


The frequency w in Eq.(15.100) can be replaced by w + wg so that 


AZ3 |Z 


p(w + w3) = Ez (w) + Ey (w £ 2ws)| 


~ 261 (w + ws) 


which is then substituted into Eq.(15.99) to obtain 


: AZ \ 
stie = (2) 
(15.101) 


The terms Ê> (w + 2w3) can be discarded since they are non-resonant and therefore of 
insignificant amplitude. What remains is 


O ABY 1 1 
ez(w) = ( ) eee <a | (15.102) 


which is called the non-linear dispersion relation (Nishikawa 1968b, Nishikawa 1968a). 
The non-linear dispersion relation is investigated by first writing 


É» (w) + Ei (w — 2w3) 
Elw — w3) 


By (w) + E2 (w + 2w3) 
Elw + w3) 


w =x + iy (15.103) 


where it is assumed that 
£ ~ we (15.104) 
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and also 
W-W3=X+tiy—w3Y —w]. (15.105) 
These assumptions and definitions have been made so that ¢1(w — w3) is close to zero, 
€1(w-+ws3) is not close to zero, and positive y corresponds to instability. The term 1/21 (w+ 
w3) can therefore be discarded as being non-resonant and the nonlinear dispersion relation 
simplifies to 
AZ \? 


€9(w)e1(w — w3) = z] (15.106) 


The linear dispersion relations £2 and €, on the left hand side are each Taylor-expanded to 
give 


ea(w) = ez(wz +x -— wə + iy) 
n de 
~ €9(we) + (x — we + iy) T bets 
= —2iw 2 — 2wə (x — we + iy) (15.107) 
and 
eilw — w3) = ey(—wytutiy+wy,—ws) 
: de, 
x ea(—w1) +(x +iy +w — w3) ne 
= QiwyD, + 2w; (x + iy +w — w3). (15.108) 
Using these expansions, the nonlinear dispersion relation becomes 
1 /AZ3\? 
i (T2 4 wo)} fi (Pity) + (£ +w- w3)} = — 
Di (Ce +9) = (c=) {i ity) + ete en) = Se (A 
(15.109) 
or, in more compact form, 
1 ee 
(@+i (T2 +y)) (z — A +i (Tı +y)) = ae ( 2) (15.110) 
where 
T=T— We (15.111) 
and 
A = w3 — (w1 + wa) (15.112) 
is the frequency mismatch. 
The real and imaginary parts of Eq.(15.110) are 
2 
7 _ 1 AZ3 
—A)z-(T Tı + = — — 15.113 
@-A)e-(Ma+y Cty) = -i (AB) asa 
z(Ty+y)+(e@ —A)(T2 +y) = 0. (15.114) 
Solving Eq.(15.114) gives 
r T: 
pe AEU Ay poA A (15.115) 


(2y +r + T2) (2y +Tı + T2) 
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and substitution for z and  — A in Eq.(15.113) results in the expression for the growth 


rate 
A? LRN 
Tə +y) (Ti 4 Ls = aN 15.116 
(T2 +y) (Tı »( a san ( l ) ( ) 


Onset of instability onset corresponds to y being close to zero and at marginal stability 
Eq.(15.116) becomes 


XZ 


(T2 +rı)y +r T2 = me (15.117) 
16004009: | 1-5 
(T2+T,) 
The growth rate near threshold therefore is 
= : aes TT (15.118) 
y= (T2 4 Tı) A2 1142 . . 
l6wiwo | 1 + ———__, 
(T2 +Tı) 
This shows that the pump amplitude threshold for instability is 
ATS A? 
Ye hac ac E E = (15.119) 
A (T2 +T: ) 


this threshold is proportional to the geometric mean of the linear damping rate of the two 
modes. If the pump amplitude is below threshold then the nonlinear instability does not 
occur. The lowest threshold occurs when A = 0, i.e., when the frequency selection rule 
is exactly satisfied. Instability when the pump amplitude exceeds a threshold is routinely 
observed in actual experimental situations as was discussed earlier, i.e., decay instability of 
a pump wave is observed to begin only when the pump wave amplitude exceeds threshold 
(for example, see Stenzel and Wong (1972)). For a pump amplitude well above threshold, 


Eq.(15.116) becomes 
2 
1 AZ A? 
y = 3 (15.120) 
Wy W2 4 4 


which shows that frequency mismatch reduces the growth rate. The situation reduces to 
Eqs.(15.84) and (15.30) when A = 0. Thus the non-linear dispersion relation formalism 
extends the Manley-Rowe coupled oscillator model to include the effects of both dissipa- 
tion and frequency mismatch. 


15.5 Digging a hole in the plasma via ponderomotive force 


If the low frequency daughter mode is at zero frequency, then the ion acoustic mode ceases 
to be a wave. Instead, it becomes a density depletion caused by the ponderomotive force 
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and so Eq.(15.49) reduces to 


1 1 Me 
Vn = 
n es 2 KT, 


y (at)? (15.121) 


e 


which can be integrated to give a Boltzmann-like relation 


fi 1 Me O 

n 2 KT. J 
Since the high frequency daughter wave is the same as the pump wave there is now only one 
high frequency wave and so there is no need to have subscripts distinguishing the modes. 
The consequence is that the high frequency wave propagates in a plasma having a density 
depletion dug out by the ponderomotive force. For example the Langmuir wave equation 
in this case would become 


(15.122) 


o? ð 3 
2T Ta + Whe 1 


1 me (PYN aKT 


2 -h __ 
eT = y? | a’ =0 (15.123) 


e 


where w” has been used as the linear variable instead of E and a linear damping term 
involving I’, the linear damping rate has been introduced. 

An undamped linear Langmuir wave in a uniform plasma satisfies the dispersion rela- 
tion w? = well + 3k? A2) where kApe << 1 so that the wave frequency is very close 
tO wpe. It is reasonable to presume that the nonlinear wave also oscillates at a frequency 
very close to wpe and so what is important is the deviation of the frequency from wpe. To 
investigate this, the electron fluid velocity is assumed to be of the form 


a(x) = Re[A(z,t)e 4] 
= > {A(x, tere? + A* (x, t)e ret} (15.124) 
in which case i 
-a2 \ 42 
((at)") = 31A] (15.125) 


and the time dependence of A(x,t) characterizes the extent to which the wave frequency 
deviates from wpe. Because this deviation is small, A changes slowly compared to wpe, 
and so in analogy to Eq.(3.22) it is possible to approximate 

G4 E ‘ —iw : OA —iw 

ag [A(a, the et] ~ —we A(x, t)e ret — 2iwpe Fe pet (15.126) 
and 

O 
T [A (x, t)e rt] x —i2wpe r Ae Pe’. 

Substitution of Eq. (15.126) into Eq.(15.123) gives 


we Me |A|? 


e 


JA 
2iwpe Fe + i2W pel A4 A+7?V7A = 0. (15.127) 
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By defining the normalized variables 


T = Whet/2 
_ A 
0 DVT] 
E = Xwy-/C 
n = IW/wye (15.128) 


Eq.(15.127) can be put in the standardized form 


8) 

ix tinx + [xl xtV2x = 0; (15.129) 
T 

this is called a non-linear Schrödinger equation since if 7 = 0, this equation resembles a 

Schrödinger equation where Ixl? plays the role of a potential energy. 


In order to exploit this analogy, we recall the relationship between the Schrödinger 
equation and the classical conservation of energy relation for a particle in a potential well 
V(x). According to classical mechanics, the sum of the kinetic and potential energies gives 
the total energy, i.e., 


2 
— +V(x) = E. (15.130) 
m 
However, in quantum mechanics, the momentum and the energy are expressed as spatial 
and temporal operators, p = —ifV and E = if0/0t which act on a wave function % so 
that Eq.(15.130) becomes 
h? Ow 
—— Vy + Vy = ih 15.131 
ime or R (15.131) 
or, after re-arrangement, 
200 a Pie, 
in— —Vy+=——VVv=0. (15.132) 
Ot 2m 


Equation (15.130) shows that a particle will be trapped in a potential well if V (+00) > 
E > Vanin Where Vmin is the minimum value of V. From the quantum mechanical point 
of view, |2|? is the probability of finding the particle at position x. Thus, existence of 
solutions to Eq.(15.132) localized to the vicinity of Vinin is the quantum mechanical way of 
stating that a particle can be trapped in a potential well. Comparison of Eqs.(15.129) and 
(15.132) shows that — af plays the role of V and so a local maximum of af should act 
as an effective potential well. This makes physical sense because Langmuir waves reflect 
from regions of high density and the amplitude-dependent ponderomotive force digs a hole 
in the plasma. Thus, regions of high wave amplitude create a density depression and the 
Langmuir wave reflects from the high density regions surrounding this density depression. 
The Langmuir wave then becomes trapped in a depression of its own making. Formation of 
this depression can be an unstable process because if a wave is initially trapped in a shallow 
well, its energy |x|? will concentrate at the bottom of this well, but this concentration of 
|x|? will make the well deeper and so concentrate the wave energy into a smaller region, 
making ||? even larger, and so on. 
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Caviton instability 
The instability outlined above can be described in a quantitative manner using the 1-D 
version of Eq.(15.129), namely 


s Ox r 2 3?x 
H H b=, —0. 15.133 
! OT 1X Ixl X ae? ( ) 
It is assumed that a stable solution x(x, t) exists initially and satisfies 
OX : 2 Ax, 
Gp tiot Xol xot Gee = 0 (15.134) 


where |Xo(x,t)| is bounded in both time and space. Next, a slightly different solution is 
considered, 

x(x, t) = x(x, t) + X(x, t) (15.135) 
where the perturbation (x,t) is assumed to be very small compared to x(x, t). The equa- 
tion for x(x, t) is thus 


A si A W PE ao 5 
iz (xo +X) + iN (Xo +X) + [Xo +X (Xo X) a 3 (xo +X) =9. (15.136) 
Subtracting Eq.(15.134) from (15.136) yields 
j ð YY Liny 1 7 2 Av 2 o? Vv 
ino X + ink 4 Ixo + XI" (Xo +X) — [Xol Xo+ T (15.137) 


Expansion of the potential-energy-like terms while keeping only terms linear in the 
perturbation gives 


xo + XI? (Xo +X) — IXol Xo = XOX" +2 [xol X (15.138) 

so Eq.(15.137) becomes 
nOn ia Q~ x 2. © 
iA + ink + xox" 4 2 [xol Xt 5 


It is now assumed that the perturbation is unstable and has the space-time dependence 


Y=0. (15.139) 


a (15.140) 
in which case Eq.(15.139) becomes 
(iy + in +2 [x0l? =k?) & = — XBX" (15.141) 
which has the complex conjugate 
(—iv- in +2 bal? =) X =- xx (15.142) 


Combining the above two equations gives a dispersion relation for the growth rate 


(y+)? = —k* +4k? Ixo — 3lxol*. (15.143) 
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The maximum y is found by taking the derivative of both sides with respect to k? and 
setting this derivative to zero, obtaining 


Reis = [alr (15.144) 


as the value of k? giving the maximum for y. Substitution of k?,,.. into Eq.(15.143) gives 


max 


Ymax = —N + Xol" - (15.145) 


Thus, the configuration is unstable when | Xal > 7 or in terms of the original variables 


when 
n 2 
u T 
= > à (15.146) 
\/ KT. / Me We 
This is called a caviton instability and it tends to dig a sharp hole. This is because each 
successive stage of growth can be considered a quasi-equilibrium which is destabilized 
and from Eq.(15.144) it is seen that the most unstable k becomes progressively larger as 
the amplitude increases. Another way of seeing this hole-digging tendency is to note that 
Eqs.(15.141) and (15.142) are coupled by their respective right hand terms and these terms 
are proportional to the amplitude of the original wave. It is this coupling which leads to 
instability of the perturbation and so the perturbation is most unstable where the amplitude 
of the original wave was largest. The digging of a density cavity in a plasma by a Langmuir 
wave was observed experimentally by Kim, Stenzel and Wong (1974); the density cavity 
was called a caviton. 
Stationary Envelope Soliton 
A special, fully nonlinear solution of Eq.(15.133) can be found in the limit where damp- 
ing is sufficiently small to be neglected so that the non-linear Schrodinger equation reduces 
to 


Ox Px _ 
ar ae? 
We now search for a solution that vanishes at infinity, propagates at some fixed velocity, 
and oscillates so that 


Ixl? x (15.147) 


x = gE. (15.148) 
With this assumption, Eq.(15.147) becomes 


-Qg 4.9? +g" =0. (15.149) 
After multiplying by the integrating factor g’, this becomes 
d Q a ep RE 
H H = 0. 15.150 
dé ( 3I tj 5) ( ) 


Since the solution is assumed to vanish at infinity, integration with respect to € from = 
—oo gives 


1 
(0? =99 — 59° (15.151) 
or d 
g = dé. (15.152) 
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By letting 
V2 
g= (15.153) 
= cosh 0 
it is seen that 
V2 
dg = — z- sinh 0d0 (15.154) 
i . cosh” 0 
in which case 
0=—-JVNE+6 (15.155) 


where 6 is an arbitrary constant. Thus, the solution is 
E /IQ 27 
cosh ( JOE = 6) 


which is localized in space. For large Q the oscillation frequency and amplitude both 
increase, and the localization is more pronounced. 

Propagating envelope soliton 

One may ask whether the above solution can be generalized to a propagating solution, 
i.e., can € be replaced by € — vt where v is a velocity? Making only this replacement is 
clearly inadequate and so a solution of the form 


X = g(E — vt) TTMS) (15.157) 


is assumed where h(£,7) is an unknown function to be determined. Substitution of this 
assumed solution into Eq.(15.147) gives 


x(E,7 (15.156) 


ðh ðh an\” 
ivg’ —Q tg +g" +2i—g' =0. 15.158 
MEEA E el ag (se) g ( ) 
Setting the imaginary part to zero gives 
Oh 
= 15.1 
v DE (15.159) 
which can be integrated to give 
h= 5 + f(r) (15.160) 
where f(T) is to be determined. The real part of the equation becomes 
Oh 2 
Qg — g +P +g" -Zg =0. (15.161) 
OT 4 
If we set 5 , 
h v 
— 15.162 
OT 4 ( ) 
then the terms linear in g vanish, f(T) = —v?7/4 and so 
vE v? 


Thus, Eq.(15.161) reverts to Eq.(15.149) which is solved as in Eqs.(15.150)-(15.156) to 
give the propagating envelope soliton 


V2Q exp (iNT + ivE/2 — iv?r/4) 
cosh ( VAE — vt) — ô) l 


x(E,7) = (15.164) 
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15.6 Ion acoustic wave soliton 
The ion acoustic wave dispersion relation is 


22 
2 k*cs 


eS (15.165) 
1+ kA 
which has the forward propagating solution 
w = ke, (1+ KAZ) 17 (15.166) 
or, for small kAp, 
w= ke, =k A}, /2 (15.167) 


where the last term is small. Since 0/0x — ik and 0/Ot — —iw the reverse substitution 
k — —i0/0z and w — i0/0t can be invoked so that the forward propagating ion acoustic 
wave can be written as a partial differential equation for say, the ion velocity 


Bu; ASe 

7 ne a -\3 1 ADs 
ioe ae ae 

After multiplying by —i, this gives the dispersive forward propagating wave equation 


(15.168) 


(15.169) 


This wave equation was derived using a linearized version of the ion fluid equation of 


motion, namely 


Ou; 
mis = GE. (15.170) 
If the complete nonlinear ion equation had been used instead, the ion fluid equation of 


motion would contain a convective nonlinear term and be 


Ou; Ou; 
i ia | = ae. 15.171 
m (GE tute) =a (15.171) 
This suggests that inclusion of convective ion nonlinearity corresponds to making the gen- 


eralization A 5 5 
ui ui OU; 
Uj 15.172 
E OE e E tee ee 
and so the forward propagating ion acoustic wave equation with inclusion of ion convective 
nonlinearity is 


Ou; Ou; BGs Ou; 
H (Ui + Cs =0. 15.17 
: i (u cs) Ox 2 dx? teat) 
This suggests defining a new variable 
U = ui + Gs (15.174) 


which differs only by a constant from the ion fluid velocity. The forward propagating 
nonlinear ion acoustic wave equation can thus be re-written as 


ðU ðU Ade, BU 


ae a 2 dx 


= 0. (15.175) 
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We now introduce dimensionless variables by normalizing lengths to the Debye length, 
velocities to c, and time to wp; = Cs /Ade. In this case 


x=-, f= t= Wit (15.176) 
so the wave equation becomes 


OX, 18x 


this is called the (KdV) equation (Korteweg 1895) and modern interest in this equation 
was stimulated with the discovery of a general solution to the soliton problem by Gardner, 
Greene, Kruskal and Miura (1967). 

Because the linear wave is forward traveling with unity velocity in these dimensionless 
variables, it is reasonable to postulate that the nonlinear solution has the forward propagat- 
ing form 


0; (15.177) 


xX =x(€-— Vr) (15.178) 
where V is of order unity. A special solution can be found by introducing the wave-frame 
position variable 


=f Vr (15.179) 
so the lab frame space and time derivatives can be written as 
ð ð 
ðE OM 
fe) ð 
=o a 15.1 
ar on (15.180) 


Substitution of these into the wave equation gives an ordinary differential equation in the 
wave-frame, 

dx 1 dx? 1dďxy 

dn 2 dn ` 2dr 
where d has been used instead of ô because the equation is an ordinary differential equation. 
A solution is now sought that vanishes at both plus and minus infinity; such a solution is 
called a solitary wave. To find this solution, Eq.(15.181) is integrated using the boundary 
condition that x vanishes at infinity to obtain 


=0 (15.181) 


Ke lgx 
2 "2dan? 


0. (15.182) 


Multiplying by the integration factor dy/d7 allows this to be recast as 


d vV x3 ~~ 1 (dy \? 
=0 15.1 
dn ( 2 6 4 \ dn a) 


and then integrating gives 


a 2 
Doa (v 2 5x) (15.184) 
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where the boundary condition at infinity has been used again. This can be written as 


d 3 
eA) er (15.185) 
91/2 Vr x 
3 3 
The substitution ‘a 
x 
2 = — 15.186 
cosh? 0 ( ) 


allows simplification of Eq.(15.185) to 


X 2V sinh 0 
d ( ) = dd. 15.187 
3 cosh? 0 ( ) 
Thus, Eq.(15.185) becomes 
B 2V = 0 ao 
cosh" @ = dn (15.188) 
F V V 
21/2 
cosh? 0 cosh? 0 
or 
/2 
—\/—d#@=d 
V n 
which can be integrated to give 
V 
d= 5 (€& — 1) (15.189) 


where €, is a constant. The propagating solitary wave solution to Eq. (15.177) is therefore 


3V 


X(E,7) = 2E ; (15.191) 


cosh? (V5 (fo €— vn) 


This solution, called a soliton, has the following properties: 


x(n) = (15.190) 


or 


1. It vanishes when |£| — oo as required. 


2. The spatial profile consists of a solitary pulse centered around the position € = €g+V7 
3. The pulse width scales as V~‘/? and the pulse height scales as V so that larger am- 
plitudes are sharper and propagate faster. 

An important property of solitons is that they obey a form of superposition principle 
even though they are essentially nonlinear. In particular, a fast soliton can overtake a slow 
soliton such that after the collision or interaction, both the fast and slow solitons retain 
their identity. The underlying mathematical theory explaining this surprising behavior is 
so complex as to be beyond the scope of this text (the interested reader should consult 
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Drazin and Johns (1989)). This theory is called inverse scattering (Gardner et al. 1967) 
and involves mapping the non-linear equation to a special linear equation which is then 
solved and then mapped back again to give the time evolution of the nonlinear solution. 
The special linear equation describes a quantum mechanical particle trapped in a potential 
well where x(€,t) plays the role of the potential. In particular, the linear equation is of 
the form Pye? + (A—x(§,t)) y = O where A is an eigenvalue and time t is treated 
as a parameter. When this linear equation is solved for (£, t) and the solution is inserted 
in the KdV equation it is found that 0A\/Ot = 0. This invariance of the eigenvalues is a 
key feature which makes possible the {nonlinear —linear} and then {linear—nonlinear} 
mappings required to construct the solution. 


15.7 Assignments 


1. Pump depletion for a system of three coupled oscillators. 


(a) 


(b 


wm 


(c) 


(d 


(g 
(h 


) 


) 


wm 


Suppose A; = Oat t = 0 but Ag and Ag are finite. What is the value of the 
constant in Eq.(15.24)? 


Suppose that sin 0 is not zero. Will A, become finite at times t > 0? If A, 
becomes finite, what constraint does Eq.(15.24) put on the value of cos@ and 
hence on sin 0? 


Use Eqs.(15.21) to write A?(¢) and A3(t) in terms of A(t) and the initial con- 
ditions A2(0), A?(0), and A2(0). 

Square both sides of Eq.(15.15c) and use the results in (b) (c) above to obtain an 
equation of the form 


1 (dA3\" 
= U(A3) =E 15.192 
5 ( T ) +U(A3) ( ) 
where Æ is a constant. Sketch the dependence of U(A3) on As indicating the 
locations of maxima and minima. 


Consider Eq.(15.192) as the energy equation for a pseudo-particle with veloc- 
ity dA3/dt in a potential well U (A3). What is the total energy of this pseudo- 
particle in terms of A2(0), A7(0), and A?(0)? What does the position of the 
pseudo-particle in the potential well correspond to? 


Considering the initial conditions given in (a), where is the pseudo-particle ini- 
tially located in its potential well? Sketch the qualitative time dependence of A3 
and indicate how features of this time dependence correspond to the location of 
the pseudo-particle in its potential well. 


Give an integral expression for the time required for Ag to go to its first zero. 


Solve Eqs.(15.15a)-(15.15c) numerically for the initial conditions given in (a) 
and use the numerical results to show that the time dependence of A3 can be 
interpreted as the position of a pseudo-particle in a potential well prescribed by 
Eq.(15.192). Note: The reduction of the pump wave amplitude as it transfers en- 
ergy to the daughter waves is called pump depletion; the mathematical behavior 
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discussed in this problem can be expressed in terms of Jacobi elliptic functions 
(Sagdeev and Galeev 1969). 


2. Stimulated Raman scattering in laser fusion: Laser fusion is a proposed method for 
attaining a controlled fusion reaction. This method involves illuminating a millimeter 
radius pellet with intense laser light so as to ablate the outer layer of the pellet. The 
radial outflow of the ablating material constitutes a radial outflow of momentum and 
so, in order to conserve radial momentum, the rest of the pellet accelerates inwards 
in rocket-like fashion. An important issue is the decay of the incident laser light 
into other waves because such a decay would reduce the power available to drive 
the ablation process. For example there could be stimulated Raman scattering where 
the incident laser light decays into a outward propagating electromagnetic wave (the 
backscattered light) and an inward propagating Langmuir wave. 


(a) 


(b) 


(c) 


Let the incident laser beam be denoted by subscript 3, the outward propagat- 
ing electromagnetic wave be denoted by subscript 2 and the inward propagating 
Langmuir wave be denoted by subscript 1. Assume that the laser wavelength is 
much shorter than the characteristic density scale length. Show that if the back- 
ward scattered electromagnetic wave has a frequency only slightly above wpe 
then it is possible to satisfy the frequency and wavenumber matching conditions 
at a location where the density is approximately 1/4 of the density where the 
incident wave would reflect. 


Draw a sketch of w versus k for the incident em wave, the backscattered em 
wave, and the Langmuir wave. Note that vre << cso that if this plot is scaled to 
show the dispersion of the electromagnetic waves, the Langmuir wave dispersion 
is almost a horizontal line. 


Draw vectors on the sketch in (a) with coordinates {k, w} so that the incident 
electromagnetic wave is a vector v3 = {k3, w3}, the backscattered wave is a 
vector V2 = {k2, w2} and the Langmuir wave is a vector vı = {k,,w 1}. Show 
on the sketch how the vectors can add up in a manner consistent with the selec- 
tion rules v3 = vı + vo. 


3. Parametric decay instability: The minimum pump amplitude for the parametric decay 
instability (electromagnetic wave decays into a Langmuir wave and an ion acoustic 
wave) is given by Eq.(15.119) to be 


VwywoaliTs 


Ez = 4 x 


where the coupling parameter was defined in Eq.(15.81) to be 


(a) 


(b) 


We qkı 
yMimewzw3 
How do the Langmuir wave frequency wy and the ion acoustic wave frequency 
wW 1 Compare to wpe (nearly same, much larger, or much smaller)? 


N= 


Taking into account the selection rules, how does the pump frequency w3 com- 
pare to wpe? What does this imply for kg and hence kı and k2? 


(c) 


(d) 


(e) 


15.7 Assignments 459 


Using the results from (a) express the instability threshold in a form where 
E0 E? /nKT, is a function of Py /w, and Tz /w2. 


Suppose a pump wave with frequency w3 propagates into a plasma with a monoton- 
ically increasing density. Show the relative locations of stimulated Brillouin and 
stimulated Raman backscattering in terms of a criterion for the local density. 
Assume that the wavelengths of the waves are much shorter than the density 
gradient scale length. 


The National Ignition Facility at the Lawrence Livermore National Lab in the 
USA, nearing completion in 2004, has an array of 351 nm wavelength lasers 
which are designed to apply 5 x 1014 watts over the surface of a target with 
initial radius of 1 mm. If these lasers are considered the pump electromagnetic 
wave with frequency w3, at what density does w3 = Wye occur. Assuming 
T, = 300 eV, what is the value of co £?/nkT, at the location where wg = Wye? 
If ly /wy = T2/w2 = 0.1 will this plasma be susceptible to the parametric decay 
instability? 
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Non-neutral plasmas 


16.1 Introduction 


Conventional plasmas, the main topic of this book, consist of a quasi-neutral collection 
of mutually interacting ions and electrons. This description applies to the vast majority 
of entities called plasma (e.g., fusion, industrial, propulsion, ionospheric, magnetospheric, 
interplanetary, solar, and astrophysical plasmas). Non-neutral plasmas are an exception 
to this taxonomy and not surprisingly, have certain behaviors which differ from conven- 
tional plasmas (Malmberg 1992, Davidson 2001). Applications of non-neutral plasmas 
include microwave oscillators and amplifiers, simulating the vortex dynamics of ideal two- 
dimensional hydrodynamics, testing basic non-linear concepts, and confining anti-particles. 


16.2 Brillouin flow 


Using a cylindrical coordinate system {r,0, z} we first consider the forces acting on an 
infinitely long, azimuthally symmetric cylindrical cloud of cold charged particles all having 
the same polarity. Because of the mutual electrostatic repulsion of the same-sign charged 
particles, the cloud of particles will expand continuously in the r direction and so will 
not be in radial equilibrium. The cloud would be even further from equilibrium if, in 
addition, it were to rotate with azimuthal velocity u = ug0, because rotation would provide 
a centrifugal force mu? /r which would add to the radially outwards electrostatic force and 
so increase the rate of radially outward expansion. 

However, if this rotating cloud of same-sign charged particles were immersed in a uni- 
form axial magnetic field B = B2, then the magnetic force ~ qu x B would push the 
charged particles radially inwards, counteracting both the electrostatic repulsion and the 
centrifugal force. This situation is shown in Fig.16.1 for the situation where the cloud con- 
sists of electrons (Malmberg and de Grassie 1975). Negatively biased electrodes are used 
at the ends to prevent cloud expansion in the z direction; if the cloud were made of ions, 
then the end electrodes would be biased positively. 


460 


16.2 Brillouin flow 461 


coils to make axial magnetic field B 


negatively negatively 
biased cylindrical electron cloud biased 


electrode ~a perfectly conducting wall electrode 


Figure 16.1: Pure electron plasma configuration. Magnetic field B = Bz produced by coils, 
electrodes on ends prevent axial expansion. 


Because there is only one charge species, there is no frictional drag due to collisions 
with a species of opposite polarity, and because the plasma is cold, the pressure is zero. 
The radial component of the fluid equation of motion Eq.(2.27) thus reduces to a simple 
competition between the electrostatic, magnetic, and centrifugal forces, namely 


mug 


0=q(E,+uBz) + (16.1) 


Because of the assumed cylindrical and azimuthal symmetry, Poisson’s equation reduces 
to 


10d n(r)q 
-—(rE,) = 16.2 
r Or Ge) £0 Cea) 
which can be integrated to give 
1 r ia 
pots 1 n(r')r'dr’. (16.3) 
€or 0 


In the special case of uniform density up to the plasma radius rp, which by assumption is 
less than the wall radius a , Eq.(16.3) may be evaluated to give 


A i forr < rp 
2£0 
Ep = r2 (16.4) 


so that inside the plasma Eq.(16.1) becomes 


22 
Wr 


2 
This is a quadratic equation for ug and it is convenient to express the two roots in terms of 


angular velocities wo = ug/T so 
—We + 4/ w? — Qw2 
: (16.6) 


2 
Since wo is independent of r, the cloud rotates as a rigid body; this is a special case resulting 
from the assumption of a uniform density profile (in the more general case of a non-uniform 
density profile which will be discussed later, the rotation velocity is sheared so that wo is a 
function of r). The two roots in Eq.(16.6) coalesce at w? = w2 /2; this point of coalescence 


uZ + ugrwe + =0. (16.5) 


Wo = 
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is called the and it is seen that real roots wo exist only if the density is sufficiently low for 
we to be below this limit. 

We will consider the situation, typical for non-neutral plasma experiments, where the 
density is well below the Brillouin limit so that the two roots are well-separated and given 
by 


—we (1 +w /2w2) (16.7) 
Wor 5 w? /2We; (16.8) 


the plus and minus signs refer to the choice of signs in Eq.(16.6). The large root wo_ is near 
the cyclotron frequency and the small root wo4 is much smaller than the plasma frequency 
since wp << wec. The small root is called the diocotron frequency. 

The rotation in the magnetic field provides an inward force balancing the outward radial 
electrostatic force of the non-neutral plasma. There also exist axial electrostatic forces 
due to the mutual electrostatic repulsion between the same-sign charges and these forces 
would cause the plasma to expand axially. Since these axial forces cannot be balanced 
magnetically, the axial forces are balanced by biased electrodes at the ends of the plasma. 
The electrodes have the same polarity as the plasma, thereby providing a potential well in 
the axial direction. Any particle which tries to escape axially is repelled by forces due to 
the repulsive bias of the end electrode and reflects before reaching the end electrode. 

The fast motion of the charged particles in the axial direction smears out axial structure 
so that the non-neutral plasma can be considered axially uniform to first approximation. 
A pair of deceptively simple-looking coupled equations relating the electrostatic potential 
ġ and the density n govern not only the equilibrium but also the surprisingly rich low- 
frequency dynamics of a non-neutral plasma. An important feature of these equations is 
that the x B drift 


I? 


Wo 


-V¢oxB 
i = (16.9) 
describes an incompressible flow, so that when this drift is inserted into the continuity 


equation 
on 
OE +u-Vn+nV-u=0, (16.10) 
the V - u term vanishes and the density time dependence results from convection only, so 
that 
On Vo xB 


B2 


Vn. (16.11) 


ot 
Equation (16.11) and Poisson’s equation 


nq 
E0 


V? = — (16.12) 
provide two coupled equations in n and @ and are the governing equations for the configu- 
ration. Although Eqs.(16.11) and (16.12) appear simple, they actually allow quite complex 
behavior which will be discussed in the remainder of this chapter. An interesting feature 
of these equations is that they have no mass dependence, a consequence of ignoring the 
centrifugal force term which affects only the high frequency branch, Eq.(16.7). Thus, the 
low frequency dynamics described by Eqs.(16.11) and (16.12) can be thought of as the dy- 
namics of a massless, incompressible fluid governed by a combination of Æ x B drifts and 
Poisson’s equation. 
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16.3 Isomorphism to incompressible 2D hydrodynamics 


Consider now the equations governing an incompressible two-dimensional fluid with ve- 
locity u =u, (r, 0) +uo (r, 0)0. Incompressibility means that the mass density p is constant 
and uniform so that the continuity equation reduces to 


V-u=0. (16.13) 


Using the vector identity u-Vu = Vu?/2 — ux Vx u the fluid equation of motion can be 
written as 


ð 1 
p = + 5Vu? -uxVxu) = -VP. (16.14) 
Taking the curl of this equation and defining the vorticity vector Q =V x u gives 
on 
‘ape V x (ux Q) (16.15) 


which has the same form as the ideal MHD induction equation, Eq.(2.81), and so can be 
interpreted as indicating that the vorticity being frozen into the convecting fluid (Kelvin 
vorticity theorem). Because 


{190 1 Ou, 
vxu= (ii (rug) z a (16.16) 
the vorticity vector lies in the z direction and may be written as 
2 =02. (16.17) 
The z component of Eq.(16.15) is 
on 
oan z2-V x (uxZ) 
= V-((uxzZ) x Zz) 
= —V (uQ) 
= uVe. (16.18) 


The incompressibility condition Eq.(16.13) means that the velocity can be written in terms 
of a stream-function 4, 


u=—-VW xz (16.19) 
so Eq.(16.18) can be written as 
OQ 
aE Vy x2- VQ. (16.20) 


Furthermore, the z component of the curl of Eq.(16.19) can also be expressed in terms of 
w since 


Q z-Vxu 


I 

| 
< 
< 
© 

x 
& 

x 
& 


= Vy. (16.21) 
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There is thus an exact isomorphism between the non-neutral plasma equations and the 
equations describing a 2D incompressible fluid; this isomorphism is tabulated below: 


Non-neutral plasma Incompressible 2D fluid 
on Vox ĉ on ats 
Vo= — Vy =2 

E0 


These sets of equations can be made identical by setting 


-2 o=- 


OSS (16.22) 


Thus density corresponds to vorticity and electrostatic potential corresponds to stream- 
function. 

A non-neutral plasma can therefore be used as an analog computer for investigating the 
behavior of an inviscid incompressible 2D fluid. This is quite useful because it is difficult 
to make a real fluid act in a truly two dimensional inviscid fashion whereas it is relatively 
easy to make a non-neutral plasma. While real fluids are three dimensional, it is neverthe- 
less important to understand 2D dynamics since this understanding can be of considerable 
help for understanding three dimensional dynamics. The plasma analog contains all the 
nonlinearities of vortex interactions that characterize the 2D fluid problem. Besides serv- 
ing as an analog computer for investigations of 2D fluid dynamics, non-neutral plasmas 
have also been successfully used as a method for trapping antimatter (Surko, Leventhal and 
Passner 1989). 


16.4 Near perfect confinement 


An curious feature of non-neutral plasmas is that collisions do not degrade confinement 
so long as axisymmetry is maintained (O’Neil 1995). Thus, in a pure electron plasma, 
electron-electron collisions do not cause leakage of the plasma out of the trap (loss of 
confinement comes only from collisions with neutrals and this can be minimized by using 
good vacuum techniques). To see this, consider the canonical angular momentum of the 
itt particle 

Po; = mrjvo; + qri Aoi- (16.23) 
A collision between two identical charged particles will conserve the total angular momen- 
tum of the two particles and so the total angular momentum of all the particles is 


N N 
SS Poi = ` (mrjve, + qriAoi) = const. (16.24) 

i=1 i=1 
even when there are electron-electron collisions. The vector potential for the magnetic field 
B = Bzis A = 6Br/2, and if the magnetic field is sufficiently strong, the inertial term 
in Eq.(16.24) can be dropped, so that conservation of total canonical angular momentum 
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reduces to the simple relationship 


N B N 
Py =~ Poi = a Sr? = const. (16.25) 
i=1 i=1 


Equation (16.25) constrains the plasma from moving radially outwards in an axisymmetric 
fashion. Thus, interparticle collisions cannot make the plasma diffuse to the wall and so a 
collisional plasma is perfectly confined so long as axisymmetry is maintained. 

An alternative interpretation can be developed by considering how collisions cause ax- 
isymmetric outward diffusion in a conventional (i.e., electron-ion) plasma. The effect of 
collisions in this situation can be seen by considering the steady-state azimuthal component 
of the resistive MHD Ohm’s law. This azimuthal component is 


—U,B =nJo = ee ne(uig — ug) = Hieber (uig — uo) (16.26) 
ne e 


and shows that axisymmetric radial flow to the wall results from collisions between unlike 
particles since 

(uio e ug). (16.27) 
Thus, radial transport requires momentum exchange between unlike species and this ex- 
change occurs at a rate dictated by the collision frequency ve; and by the difference be- 
tween electron and ion azimuthal velocities. If only one species exists, it is clearly impos- 
sible for momentum to be exchanged between unlike species and so there cannot be a net 
radial motion. A practical consequence of this result is that electron confinement in pure 
electron plasmas is orders of magnitude larger than confinement in quasi-neutral plasmas 
(hours/weeks compared to microseconds/seconds). 


16.5 Diocotron modes 


Non-neutral plasmas support linear waves that differ from those of a quasi-neutral plasma 
(Gould 1995). The theory of low frequency linear waves in cylindrical non-neutral plasmas 
can be developed by linearizing the continuity equation Eq.(16.11) to obtain 


oni + uo: Vni + ui Vno = 0. (16.28) 
Using Eq.(16.9) to give ug, u, and Poisson’s equation Eq.(16.12) to give no, nı results in 
the wave equation 
2 
YA Yi = Bids Tans - YV? = 0. (16.29) 
Because of the cylindrical geometry, it is convenient to decompose the perturbed poten- 
tial into azimuthal Fourier modes 


gilr, 0, t) = S Alr the”? (16.30) 


where 


27 
int) =5- f dog (r, 0, the. (16.31) 
0 
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The 7t” azimuthal mode is assumed to have a time dependence exp(—iw t) so that 


by (r,0,t) = X dj (rel eet 
l 


and the Fourier coefficient is given by 


2T 


Using the equilibrium azimuthal velocity 


2T 
a= Vol oneta 
0 


-Vø xB ; 
ueo(r) = ans 0 


the temporal-azimuthal Fourier transform of Eq.(16.29) can be written as 


’ lugo(r) 2% lo, d 2 a 
(«- 7 Vo tinan do = 0. 


Using the relations 


te 1d [dé [2 ~ 
y2 = 1 

bı rdr | dr ) r2 bı 
1d (3000 
rdr dr 


this temporal-azimuthal Fourier transform of Eq.(16.29) can be expanded as 


(« nw) ( d (2) Sa) ib, d (: d (e) = 
r rdr dr r2 rBdr \rdr dr 


The equilibrium angular velocity is 


V7 ho 


ugo(T) 1 do 
wo(r) = r -rB dr 


and so Eq.(16.38) can be expressed as 


1d / d¢ P- ld, d /1d 
(a= dear) [: dr ( i) 5a) ! + dr G dr (u0) ) =n 


where the / subscript has been dropped from w. 


Since Eq.(16.37) gives 
d zai 
SPA | atian 
dr Eor Jo 


(16.32) 


(16.33) 


(16.34) 


(16.35) 


(16.36) 


(16.37) 


(16.38) 


(16.39) 


(16.40) 


(16.41) 


the equilibrium angular velocity can be evaluated in terms of the equilibrium density to 


obtain 


wo(r) = a no(r')r'dr’. 
0 


egBr? 


(16.42) 
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Equation (16.42) reduces to the special case of rigid body rotation when the density is 
uniform, but in the more general case there is a shear in the angular velocity. It is seen from 
Eq.(16.39) that 


Led ne 1 d / d nq w?(r) 
aR Py, = = = : 16.4 
r dr (r wo(r)) Br dr (r dr ) E9B We ee) 
Furthermore, since ; 
d {1dr 
pang (peepee Weg 16.44 
dr \r =) aie) 


an arbitrary constant can be added to wo(r) in the last term in Eq.(16.40), which conse- 
quently can be written as 


d [ dé HE ~d fld 
[w — lwo(r)] (3 ( i) =a biz Gå [r? (w tn) =0; (1645) 


this is called the diocotron wave equation. 
The dioctron wave equation can be written in a more symmetric form by defining 


$ = ro 
G = r?(w-— lwolr)) (16.46) 
and using 
da f di\ d(/1ldb)\, 2 
dr ( i) mies G dr ) ' r2 cs?) 
as d /1d® d /1dG 
i 2 Tr 7 _ 
ae G z) + (1-2?) ta G =) =0. (16.48) 


16.5.1 Wall boundary condition 


While the wall boundary condition does not affect the equilibrium of a non-neutral plasma, 
it plays a critical role for the diocotron modes since these modes are non-axisymmetric. Be- 
cause the wall is perfectly conducting, the azimuthal electric field must vanish at the wall, 
i.e., Eg(a) = 0. This is trivially satisfied for the equilibrium because the equilibrium is ax- 
isymmetric and Eg = —r~ 184/30 vanishes everywhere for an axisymmetric field. How- 
ever, for a non-axisymmetric perturbation, the azimuthal electric field is Een = —ild, /r 
which is, in general, finite. Thus, in order to have the azimuthal electric field vanish at the 
wall, each finite / mode must satisfy the wall boundary condition h(a) =0. 


16.5.2 The / = 1 diocotron mode: a special case 


The / = 1 mode is a special case and has the exact solution 


® = AG (16.49) 
where A is an arbitrary constant. In terms of the original variables, this solution is (White, 
Malmberg and Driscoll 1982) 


Jlr) = Hol) (16.50) 
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where S is a constant. Since Eq.(16.48) is a second-order ordinary differential equation, it 
must have two independent solutions and Eq.(16.50) must be one of these solutions. The 
other solution has a singularity at r = O and is discarded as being non-physical (this other 
solution can be found by assuming it is of the form © = FG and substituting this into 
Eq.(16.48) to find F). It is now evident that rigid body rotation corresponds to having 
G = 0 for all r since wo(r) is a constant and w = wo(a) for a rigid body; if G = 0 then 
Eq.(16.48) shows that the form of ® is unrestricted and so rigid body rotation can have 
any perturbation profile. On the other hand, if the rotation has even the slightest amount of 
shear, then G' must be non-zero and the solution is restricted to be of the form specified by 
Eq.(16.50). 

The perfectly conducting wall boundary condition Hg = O implies that y= (a) = 0 
and so Eq.(16.50) gives the l = 1 mode frequency to be 


w = wola) 
a “panes Saar 
coBa? Jo 
E, (a) 


which has the interesting feature of depending only on E, (a), the equilibrium radial field 
at the wall. Since £,(a) depends only on the total charge per length, the 1 = 1 mode 
frequency depends only on the total charge per length and not on the radial profile of the 
charge density. The coefficient S has been chosen so that the perturbed radial electric field 


at the wall is 
~ _ Saf dwo(r) 
E, (a) = E ( ay on (16.52) 


Since no(a) = 0 by assumption, Eq.(16.42) shows that 


Sar). = Ce) 


2 
Se (16.53) 
a 
and so 
S=E,(a). (16.54) 
16.5.3 Energy analysis of diocotron modes 
The linearized current density of a diocotron mode is 
Jı = mquo+ noquy 
> E, xB 
= mqrwolr)ð + nog Ee (16.55) 


B2 


and the nı qrwo(r)ĝ equilibrium flow term enables the wave energy to become negative. 
The wave is electrostatic so Eq.(7.56) gives the wave energy density to be (Briggs, Daugh- 
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erty and Levy 1970) 


t OE 
w [ at(Br (ata) 


t 
amf dt (E1-J1) . (16.56) 


Unlike the uniform plasma analysis in Section 7.4, here both plasma non-uniformity and 
wall boundary conditions must be taken into account. The change in system energy W due 
to establishment of the diocotron wave is found by integrating w over volume and is 


t 


wW fë (2 (V -6 Vb, — 619761) +f 


— co 


dt nigrwo(r) E16 ) 


1 t 
= afar (sme + rwo(r) Í dt nı Eio ) (16.57) 
where Eq.(16.55) and the perfectly conducting wall boundary condition ¢,(a) = 0 have 
been used. Using Eq.(7.62) and (16.32), the energy for a given / mode can be written as 


1_ ~x t ~ 
W, = 4 Re J. dr (Si vit + rwo(r) if dt iu Eig ene) (16.58) 
In order to evaluate this expression, it is necessary to express all fluctuating quantities 

in terms of the oscillating potential ,. Using Poisson’s equation in Eq.(16.35) it is seen 
that 

a dy dno 

= 16.59 

ot rB(w—lwo(r)) dr ( ) 

and the complex conjugate of the linearized azimuthal electric field is 


r aN toy 
Ei, = (=) DON (16.60) 
r r 
The wave energy is thus 
~ (2 
a| dno Atut 
moa e I dr ane =o dE 
2B il /4,| et an 
+rwo(r) fi dt 2 
=%  r?(w— lwoọ(r)) dr 
e2wit 
2m a aT. -a Sa 
By eae ard “ap Ww- lwo) ano] al 
4B Jo 0 +wo(r) fÉ ile dt dr ee 
Pi = (w — lwolr)) 


(16.61) 
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where L is the axial length of the plasma. The second term above can be directly evaluated 
as 


RE 
i 2w;t : l 
Re : dt sl aa =, Re £ i 2il(wr — lwo(r) — iwi) > T ati 
S (w — lwo(r)) eg (wr — lwo(r))? + w? l 
$ 
12w; po 2 
= dt i pai J 
he (wr — lug (r))? +w? dl )| e 
d g j 2w;t 
aard ë és 
(wr — lug (r))? + w? |"! 
l ~ 12 
(wr — lwo(r))? hı) as w; + 0 (16.62) 


Thus, in the limit w; — 0, 


qL f” 4 1 1 wo(r) dno j~ |2 

-5f ao f a |( ey " Tw — lwo(r))? ral 
wea w dno, 2 

7 2B Jo "Tw — lwo(r))? dr 


For the / = 1 mode this can be evaluated using Eq. (16.50) to give 


wont $ no(r’)r'dr' 
E£oBa? Jo 


ôw, BS pa eC ce 
i 2B Jo (w — wo(r))? dr 


LS ff d 
= a J dr 22,2 
0 


OW, 


py 


— f a rno(r)dr (16.64) 
0 


where no(r) = O for rp < r < a has been used. Finally, using the middle line of 
Eq.(16.51) this becomes M 

OWia1 = ee (16.65) 
and so the l = 1 dioctron mode is seen to be a negative energy mode. This result can 
also be derived by considering the change in system energy which results when the non- 
neutral plasma is attracted to a fictitious image charge chosen to maintain the wall as an 
equipotential when the plasma moves off axis (see assignments). 

For 1 > 2, the negative energy nature of the diocotron mode can be understood by 
considering the change in the electrostatic energy stored in an isolated parallel plate 
capacitor when the distance between the parallel plates is varied. The stored energy is 
W = CV?/2 = Q?/2C and Q is constant because the capacitor is isolated. Increasing the 
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capacitance at constant Q reduces W, so bringing the plates together makes available free 
energy which could be used to move the plates even closer together. The system is there- 
fore unstable with respect to perturbations that tend to increase the capacitance. This is the 
electrostatic analog of the ideal magnetohydrodynamic flux-conserving situation where the 
energy is W = 7/20 and © is the magnetic flux linked by an isolated inductor L; any 
flux-conserving perturbation which increases inductance makes available free energy for 
driving an instability. 


If all the charge of a non-neutral plasma is assumed to be located at the radius r,,, 
then the combination of the non-neutral plasma and the wall at radius a is effectively 


a coaxial capacitor. Gauss’ law shows that the radial electric field is 2mregH, = A 
where A is the charge per length. The voltage difference between the plasma and the wall 
is therefore V = — f? E,dr = (2meo)~'In(a/rp) and the capacitance per length is 


C' = 2720 [In(a/ Pay Thus, increasing r, increases the capacitance and decreases the 
electrostatic energy associated with the fixed amount of charge. However, such an az- 
imuthally symmetric increase of rp is forbidden because it would rarefy the plasma, an 
impossibility since the only allowed motion is an E x B drift which is incompressible for 
an electrostatic electric field. 


The plasma can circumvent this constraint by undergoing an azimuthally periodic in- 
compressible motion that, on average, increases the capacitance. This could be arranged by 
splitting the system into an even number of equally spaced azimuthally periodic segments 
and arranging for an incompressible motion where even-numbered segments move towards 
the wall and odd-numbered segments move away from the wall. The volume between the 
wall and the plasma would thus be conserved so the system would be incompressible, but 
because capacitance scales as the inverse of the distance between a segment and the wall, 
the increase in capacitance due to the segments moving towards the wall would exceed the 
decrease of the segments moving away from the wall. Thus, the electrostatic energy of the 
system would decrease and free energy would be available to drive an instability. 


16.5.4 Resistive wall 


Let us now assume that the conducting wall has an insulated segment of length L, and 
angular extent A which is connected to ground via a small resistor R. The remainder of the 
wall is directly connected to ground; this is sketched in Fig.16.2. The wall is thus at or near 
ground potential and so there are no electric fields exterior to the wall, and in particular 
there is no radial electric field just outside the wall. 


In order for the radial electric field to vanish outside the wall, the wall must have a 
surface charge density that creates a radial electric field equal and opposite to the radial 
electric field produced by the plasma so 


—-S=—. (16.66) 
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The surface charge flows back and forth between the wall segments and ground. The charge 
on the segment extending from 0 = —A/2 to A/2 is 


A/2 
On = L J adda ,e! 
-4/2 
elA/2 _ ẹ-ilA/2 
il 
2 A 
= — Teos sin z (16.67) 
The electric current J flowing through the resistor is the rate of change of this surface 
charge, i.e., for the l = 1 mode this current is 


= L.a 0} 


I = 2iwL,aeoS sin (16.68) 


perfectly wall 
conducting pure segment 
wall electron 

plasma 


Figure 16.2: Arrangement for resistive wall instability of a non-neutral plasma. Resistor 
R is connected between ground and an isolated segment of azimuthal extent A and axial 
extent Ls. Remainder of wall is connected directly to ground. 


The rate at which energy is dissipated in the resistor is 


2 
(PRY = 5 (22Lsae08sin 5 ) R. (16.69) 


If the mode frequency is allowed to have a small imaginary part then the rate of change of 
electrostatic energy will be 
P= 2w,dW. (16.70) 
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Since the sum of the thermal energy in the resistor and the electrostatic energy in the plasma 
constitutes the total energy in the system, conservation of this sum gives 


2w,6W + (PR) =0 (16.71) 

or 
TE 16.72 
Si = SW G72) 


Because the mode has negative energy, w; is positive corresponding to instability. 
Substitution into the RHS gives 


A 
4w? RL2< 9 sin? A 
TL 


where L is the axial length of the wall and Ls < L is the axial length of the segment. 
This dependence of the growth rate on resistance has been observed in experiments (White 
et al. 1982). If the resistor R is replaced by a parallel resonant circuit, then the instability 
will occur at the resonant frequency of this circuit, because the effective resistance seen 
by the non-neutral plasma wall segment will be at a maximum at the resonant frequency. 
This is the basis of the magnetron tube used in radar transmitters and microwave ovens (see 
assignments). 


Wi = 


(16.73) 


16.5.5 Diocotron modes with | > 2 


The analysis of l > 2 diocotron modes resembles the Landau analysis of electron plasma 
waves but is not exactly the same. In general l > 2 diocotron modes must be considered 
using numerical methods because their behavior depends on the equilibrium density profile 
via coefficients of both the first and last terms in Eq.(16.38). However, some indication of 
the general behavior can be obtained analytically. An important condition can be obtained 
(Davidson 2001) by assuming that w = w, + iw; and writing Eq.(16.38) as 


1d dé 2- w, — lwolr))— iw; l nbq ~ 
y Qı Er ( o( )) 04 Pei. (16.74) 
rdr dr r (wr — lwo(r)) + wi rB € 


After multiplying through by rors integrating from r = 0 to r = a, and using the perfectly 
conducting boundary condition ¢;(a) = O when integrating the first term by parts, the 
following integral relation is found: 


2 


à d¢ Pie: (are h — iw; ee. ni 
[ ar gr] 4 fg)? 4 Heol) wi g PE oa | =o. 16.75) 
0 dr r (wr — lwo(r)) +w? rB & 
The imaginary part of this expression is 
~ 42 
l Tp Qi d 
agd J r : % es (16.76) 
corB Jo (wr — lwo(r)) +w? dr 


474 Chapter 16. Non-neutral plasmas 


the upper limit has been changed to r, because no(r) and dno/dr are by assumption zero 
in the region rp < r < a. If dno/dr has the same sign throughout the radial interval 
O < r < rp, then the integral would have to be non-zero since the integrand always has 
the same sign, and so w; would have to be zero. Thus, a necessary condition for w; to be 
finite is for dno /dr to change signs in the interval 0 < r < rp. This necessary condition 
corresponds to no(r) having a maximum in the interval 0 < r < rp. This sort of profile is 
commonly called hollow, because no(r) starts at some finite value at r = 0, increases to a 
maximum at some finite r < rp, and then decreases to zero at r = Tp. 

Further progress can be made by expressing the diocotron equations as a pair of coupled 
equations for the density and potential perturbations, i.e., 


Be. oad lo, dno 
Sa rB(w —lwo(r)) dr ew) 
1d / dé, [2 . hug 
ar 16.7 
rdr ( a) pe E0 sO?) 


Rather than substitute for ñ; in order to obtain Eq.(16.38), instead Eq.(16.78) is solved 
for h using a Green’s function method (Schecter, Dubin, Cass, Driscoll, Lansky and 
O’ Neil 2000). This approach has the virtue of imposing the perfectly conducting wall 
boundary condition on h at an earlier stage of the analysis before the entire wave equation 
is developed. The set of solutions to Eq.(16.78) are then effectively restricted to those satis- 
fying the wall boundary condition, and only this restricted set is used when later combining 
Eqs.(16.78) and (16.77) to form a wave equation. 

The Green’s function solution to Eq.(16.78) is obtained by first recasting Eq.(16.78) 


as 
g i ; 
a (2) Frie = ds ô(r — s)fu(s). (16.79) 
0 


r dr dr 
Thus, if w(s,7) is the solution of 


ld d Po 
Rar Gace s) = 307,38) = 00 = 8) (16.80) 
where 0 < s < a then 
o(r) = 1f dsy(r, s)ñ(s) (16.81) 
E0 0) 


is the solution of Eq.(16.78). The spatial boundary conditions are accounted for when 
solving Eq.(16.80) for the Green’s function w(r, s) and so are independent of the form of 
fy. 

Equation (16.80) is solved by finding separate solutions to its homogeneous counter- 


part 
ld /adab\ Ê 
b=0 16.82 

rdr (~ E) pe ( ) 


for the inner interval 0 < r < s and for the outer interval s < r < a and then appropriately 
matching these two distinct homogeneous solutions at r = s where they meet. The inner 
solution must satisfy the regularity condition 7)(0) = 0 and the outer solution must satisfy 
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the wall boundary condition 7(a) = 0. Since the solutions of Eq.(16.82) are Y ~ r*!, the 
inner solution must be 


l 
We A forO< r<s (16.83) 


and the outer solution must be 


v=8( (2) - ay fors<r<a (16.84) 


where the coefficients œ and 8 are to be determined. 
Integration of Eq.(16.80) across the delta function from r = s_ to r = s 4 gives the 
jump condition 
d 
BR s) S =-1 (16.85) 


and integrating a second time shows that w must be continuous at r = s. These jump and 
continuity conditions give two coupled equations in œ and ĝ, 


ONAT- = aw 
(O-O) -0 a 

Solving for a and gives the Green’s function, 
ened QAO or a 


Ga 0 as 


this satisfies Eq.(16.80) and also the boundary conditions at r = O and r = a. Using the 
Green’s function in Eq.(16.81) gives 


itr) = 2] aswe sil) f asue sit) | 


zel TAO EO 
) 


= 16.89) 
2l £o m (Z l ye o r 
d 
+f, ds ( 5 ( = z iu (s) 
Finally, using Eq.(16.77) to substitute for fu (s) gives 
f a s! ( r! =) dls) dno 
D q o a \a ri} w-—lwo(s) ds (16.90) 


~ 20B p/s a o(s) dno 
ds 
r @\ a sl} w—lwo(s) ds 
This integral equation for ,(r) not only prescribes the mode dynamics, but also explic- 
itly incorporates the spatial boundary conditions. The resonant denominators w — lwo(s) in 
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the s integrals are reminiscent of the resonant denominators occurring in the velocity space 
integrals of the Landau problem. Stability depends on the sign of dno/ds at the location 
where w = Iwo(s) in analogy to the dependence of Landau stability on the sign of d fo /du 
where v = w/k. Because these equations are isomorphic to the 2D incompressible hy- 
drodynamic equations, it is seen that phenomena similar to Landau damping or instability 
should occur in 2D incompressible hydrodynamics. 


16.5.6 Phase mixing and relation to Landau damping 


The behavior of a localized infinitesimal bump or patch of increased density which is ini- 
tially off-center provides insight into vortex dynamics. If the equilibrium angular velocity 
is sheared, then the patch will become sheared because the inner and outer portions of the 
patch will rotate at different angular velocities. Thus the angular position of different radial 
positions of the patch will have trajectories scaling as 6(r) = two(r). The angular separa- 
tion between two points in the patch starting at respective radii r and r + ôr will scale 
as 60 = térdwo/dr and this separation will eventually exceed 27 at sufficiently large t. 
This gives a sort of spatial phase mixing (Gould 1995), because a localized density patch of 
increased density will eventually be stretched out to become a multi-turn thin spiral of in- 
creased density. The number of turns in the spiral increases linearly with time. The patch is 
thus smeared out over all angles and so is no longer azimuthally localized. Since the patch 
is stretched azimuthally and yet is incompressible, the thickness of each turn in the spi- 
ral must decrease as the number of turns in the spiral increases. Specifically, the length of 
the spiral increases with t and the radial thickness of each turn decreases as 1/t so that the 
area remains constant. A graphic demonstration of this stretching and thinning has been 
obtained by Bachman and Gould (1996) by direct numerical integration of the dynamical 
equations with an initial condition consisting of a prescribed density patch. Since the de- 
cay and eventual disappearance of the patch as it deforms into a nearly infinitely long and 
nearly infinitely thin spiral is a spatial analog to the velocity space phase mixing underly- 
ing Landau damping, it might be expected that the non-linear mixing of two such patches 
initiated at two successive times might give rise to a spatial echo effect (Gould 1995). This 
spatial echo effect involving the nonlinear beating of two spatial spirals has been seen ex- 
perimentally (Yu and Driscoll 2002). In this experiment an l = 2 perturbation is applied 
and then decays as it is stretched into a nearly infinitely long, nearly infinitely thin spiral. 
Then an / = 4 perturbation is applied which similarly decays. Finally, after both of the per- 
turbations have decayed a nonlinear l = 2 echo is observed because of nonlinear mixing of 
the two spirals associated with the respective initial perturbations. 

If the patch has a finite amplitude, then its self-electric field will affect the dynamics. 
This is the non-linear regime and will cause a sigmoidal curling of the patch since azimuthal 
electric fields associated with the patch will give radial motions in addition to the azimuthal 
motions. 


16.6 Assignments 


1. Relationship between non-neutral and conventional plasma equilibria: 


Compare the following two plasmas, both of which have cylindrical symmetry, axial 
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uniformity, the same radial electron density profile ne(r), and are in equilibrium: 


(a) a cylindrical pure electron plasma immersed in a strong magnetic field B =B2 
where wee << w? 


(b) an unmagnetized quasi-neutral electron-ion plasma which has infinitely massive 
ions. 


What is the ion density in the quasi-neutral plasma and what electric field is produced 
by this ion density? How does the force associated with the electric field produced by 
the ions in the quasi-neutral plasma compare to the magnetic force associated with 
electron rotation (finite ueg) in the pure electron plasma? What is the net equilibrium 
force on the electrons in the two cases? Assuming that the electrons have zero mass, 
which plasma has more free energy? 


. Free energy associated with sheared velocity profile in slab approximation: Suppose 
a non-neutral plasma does not rotate as a rigid body and instead has a sheared angular 
velocity. An observer rotating with the plasma at some radius robs would conclude that 
the plasma has positive angular velocity for r > robs and a negative angular velocity 
for r < Tops. Since the rotational velocity is due to an E x B drift, this means that 
the effective radial electric field measured in the observer frame must change sign at 
rT = Tobs- 


(a) Show that this means that there is an effective charge sheet at r = fobs. 


(b) Consider the limiting situation where the above situation is modeled using slab 
geometry so that r — x and 0 — y, B =B2 and the non-neutral plasma initially 
centered about x = 0, has width w and a uniform density n for |x| < w/2. This 
equilibrium is sketched in Fig.16.3. There is vacuum in the region between the 
plasma and perfectly conducting walls at x = +a where a > w/2. What are the 
electric field and the plasma flow velocity for x > 0 and for x < 0? 


(c) Now suppose that the location of the non-neutral plasma charge sheet is per- 
turbed so that instead of being centered at x = 0, the location of the center 
becomes a function of y and is given by x = A(y) = A cos ky where kd << 1 
so that the wavelength in the y direction is very long. What is the boundary 
condition on /, and hence on ¢ at the walls? 
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Figure 16.3: Left: equilibrium configuration for non-neutral plasma of width w, centered 
between pair of perfectly conducting walls at v = +a. Right: perturbed configuration 
where position of x-midpoint of plasma is located at A = A cos ky. 


(d) Since there is vacuum between the plasma and the wall and kd << 1, what is 


(e) 


(f) 


the limiting form of Poisson’s equation in the regions between the plasma and 
the walls? 


Assume that w is small, so that the non-neutral plasma can be approximated 
as being a thin sheet of charge. Being perfectly conducting, the walls must be 
equipotentials, and so without loss of generality, this potential can be defined to 
be zero. Show that the potential to the right of the charge sheet must be of the 
form 


p, = a(a— x) (16.91) 
and the potential to the left of the charge sheet must be of the form 
dı = pla + x) (16.92) 


where œ and 8 are coefficients to be determined by considering the jump in 
electric field at the charge sheet. 


Taking into account the jump in the potential at the charge sheet and the continu- 
ity in the potential at the charge sheet, solve for œ and 8. Assume that w << a 
consistent with the assumption that the non-neutral plasma can be considered as 
a thin charge sheet. Which of ER, E? is larger? 
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(g) Suppose that the length of the plasma in the y direction is L where L is an 
integral multiple of 27 /k, the wavelength in the y direction. By calculating EF, 
to the right of the charge sheet and also to the left of the charge sheet, show that 
the energy stored in the electrostatic electric field is approximately 


where hA is the length in the z direction. How does W change when A is in- 
creased? What does this suggest about the stability of sheared velocity profiles 
with respect to perturbations as prescribed in (c)? 


3. Image charge in cylindrical geometry: Two lines charges A; and Az are aligned along 
the z axis and located at rı = x, % and r2 = x2% as shown in Fig.16.4. What values 
of x2 and Az are required in order for the cylindrical surface \/x? +y? = a to be 
an equipotential as would be required in order to have a perfectly conducting wall at 
r = a? The following hints should be useful: 


(a) Define a cylindrical coordinate system r,0,z so that the cylindrical surface is 
given by r =af. Determine the potential on the cylindrical surface r = a. 

(b) Show that |?— #2, /al = |fa/x2— ĉ| if a/xg = 21 /a. 

(c) Show that A2 = — A; is required in order for the cylindrical surface r = a to be 
an equipotential. 


Ao 


AS 


perfectly 
conducting 
wall 


Figure 16.4: Line charge A; is displaced a distance xı from axis of perfectly conducting 
cylindrical wall of radius a. An image charge A2 is located at x2. 


4. Diocotron mode energy using image charge method: A cylindrical non-neutral plasma 
is translationally invariant in the z direction and is bounded by a perfectly conducting 
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wall at r = a. The plasma has an equilibrium density profile n = n(r) forO < r < rp 
and n = 0 for rp < r < aso there is a finite-size vacuum region between the edge 
of the plasma r, and the wall radius a. Calculate the change in system energy if the 
plasma axis is shifted off the wall axis and compare this result to Eq.(16.65). Hints 
are given below: 


(a) Show that the electric field outside the plasma is the same as the electric field of 
a line charge located on the plasma axis. 


(b) Suppose that x denotes the distance the plasma is shifted off the wall axis. What 
is the location and strength of the image charge required for the wall to be an 
equipotential? Is the force between the plasma and the image charge attractive 
or repulsive. How does the image charge move as increases from zero to some 
finite value? 


(c) Calculate the force on the image charge. What is the force on the plasma? What 
work is done by the plasma in order to be displaced by a finite amount x? Ex- 
press this work in terms of the radial electric field perturbation at the wall. 


(a) (b) (c) 
Kd +d aa plates bd + 7 
= | Lt 
Q ae Ve | | 
| | || 
| | 
C ed he 
ee | i 
z fs |2@ A 
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Figure 16.5: 


5. Negative energy in terms of two capacitors with fixed charge: Consider two situations 
where a charge Q is stored on the two parallel plate capacitors Ci and Cə that de- 
form in such a way as to conserve total system volume; these situations are shown in 
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Fig.16.5(a) and (b). 


(a) Here the center plate is displaced off center by an amount z so that the volume 
occupied by the two capacitors is conserved. What is the change in the system 
energy when the center plate is displaced? Assume that the value of each capac- 
itor is C1,2 = €9A/di,2 where dı,2 is the distance between the parallel plates 
and initially d4 = d2. 


(b 


wm 


Show that the configuration in Fig.16.5(b) is electrically equivalent to the con- 
figuration in Fig.16.5(a). Discuss how the system shown in Fig.16.5(c) would 
relate to an l = 4 diocotron mode (recall that any perturbations to the plasma 
must be incompressible). 


6. Magnetrons as a generalization of non-neutral plasmas undergoing a resistive wall 
instability: The magnetron vacuum tube used in radar transmitters and in microwave 
ovens can be considered as a non-neutral plasma undergoing a negative energy dio- 
cotron instability. These tubes are simple, rugged, and efficient. The relation between 
the non-neutral plasma resistive wall instability and the magnetron is shown in the se- 
quence of sketches Figs.16.6(a)-(d). The magnetron has a cylindrical geometry with 
an electron emitting filament (cathode) on the z axis, a segmented cylindrical wall 
(anode) and a z-directed magnetic field produced by permanent magnets. Instead of 
having a resistor R, connected across the gap of a segmented wall as in Fig.16.6(a), 
the magnetron has a set of cavities connected across the gap as in Fig.16.6(d). The 
cavities function as a resonant circuit and the output circuit loading provides an effec- 
tive resistance at the cavity resonant frequency. From the point of view of the plasma, 
the power appears to be dissipated in a resistor across the wall gap. However, since 
the cavity is coupled to an output waveguide, the power is actually transported away 
from the magnetron via a waveguide to some external location where the power is 
used to transmit a radar pulse or cook a meal. If the cavities in Fig.16.6(d) are phased 
0,7 then the ten cavities provide five complete azimuthal wave periods or l = 5. If 
the electrons have near Brillouin flow, and l = 5, what axial magnetic field should 
be used in order to have an output frequency f =2450 MHz, the frequency used in a 
home microwave oven. Why would the electron density increase to the point that the 
flow is nearly Brillouin, and why is the electron density not higher than this value? 
If the voltage drop between anode and cathode is 4 kV and the electron density is 
uniform, what nominal wall radius (distance between cathode and anode) should be 
used? Show that a parallel resonant circuit with a small resistance in series with the 
coil as 16.6(b) has an effective resistance which peaks at the resonant frequency; why 
does one want the effective resistance to peak at the resonant frequency? 


482 Chapter 16. Non-neutral plasmas 


(a) perfectly (b) perfectly 
conducting conducting 
wall wall 
— segment segment 

= ; 


(d) 


athode 


electron cloud 


inductance 


capacitance 


Figure 16.6: (a) Pure electron plasma with one wall segment connected to ground via a 
resistor R, (b) same, but now connected to ground via a resonant circuit consisting of 
a capacitor C' in parallel with a coil L and a resistor R, (c) same, but now a set of 10 
resonant circuits connected across 10 gaps, (d) same, but now the coil is a single turn and 
the capacitance is in the leads connecting to the coil (the resistor is the loading of the 
coupler to the output circuit, typically a loop inserted into a coil). 


7. Spiral due to sheared rotation: Suppose that a non-neutral plasma has a sheared angu- 
lar velocity so that wo = wo(r). Suppose that a radial line is painted onto the plasma 
at t = 0 extending from z = r,/4,y =Otox = r,/2, y = 0 where r, is the plasma 
radius. Suppose that wo = Wor/r,. Calculate the trajectory of points on the painted 
line and plot this for successive times using a computer. If the initial line had finite 
thickness and hence finite area, what would happen to the thickness of the line with 
increasing time? 
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Dusty plasmas 


17.1 Introduction 


Non-neutral plasmas had one less species than a conventional plasma (one species instead 
of two); dusty plasmas have one more species (three species instead of two). Not sur- 
prisingly, the addition of another species provides new freedoms which give rise to new 
behaviors. 

The third species in a dusty plasma, electrically charged dust grains, typically have a 
charge to mass ratio quite different from that of electrons or ions. Several methods for 
charging dust grains are possible. Electron bombardment is the usual means for charging 
laboratory dusty plasmas, but photoionization or radioactive decay could also be operative 
mechanisms and may be important for certain space and astrophysical situations. Photoion- 
ization would make dust grains positive because photoionization causes electrons to leave 
dust grains. Radioactive decay of dust grains would make the dust grains develop a polar- 
ity opposite to that of the particle emitted in the decay process, e.g., alpha particle emission 
by dust grains would make the dust grains negative. 

We shall consider here only the typical laboratory dusty plasma situation where the 
plasma is weakly ionized and dust grain charging is due to electron bombardment. Negative 
charging occurs because the electrons, being much lighter than the ions and usually much 
hotter, have a much larger thermal velocity than the ions. As a result, when a dust grain 
is inserted into the plasma, it is initially subject to a greater flux of impacting electrons 
than impacting ions, thereby causing the dust grain to become negatively charged. This 
negative charge eventually becomes sufficiently large to repel incident electrons and thus 
attenuate the incoming electron flux. On the other hand, the negative charge of the dust 
grain accelerates incident ions thereby increasing the ion flux to the dust grain. The net 
charge on the dust grain reaches equilibrium when the electron and ion fluxes intercepting 
the dust grain become equal. This is a dynamic equilibrium because it involves a continuous 
flow of plasma to the dust grain. Some kind of external source is thus required to replenish 
the plasma electrons and ions continuously, because otherwise all the plasma electrons and 
ions would eventually collect on the dust grain. The dust grain charging process is quite 
similar to the development of a floating potential on an insulated probe immersed in a 
plasma (see Eq.(2.124)). 
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`o grain A 


Figure 17.1: Electrons and ions may be considered as being collisionless inside a sphere of 
diameter lm fp surrounding a dust grain. The trajectory an ion making a grazing collision 
is shown; this ion has an impact parameter b. 


17.2 Electron and ion current flow to a dust grain 


Because an individual dust grain is much heavier than an electron or an ion, it can be 
approximated as an infinitely massive scattering center for the electrons and ions colliding 
with it. In typical dusty plasma laboratory experiments the collisional mean free path lm fp 
greatly exceeds the Debye length. This situation is sketched in Fig.17.1 where a dust grain 
with radius rg is surrounded by an imaginary sphere with diameter J,, 7, and the Debye 
radius is much smaller than lmfp. Electrons and ions are collisionless inside the lm fp 
sphere and so have Keplerian orbits associated with the electrostatic central force produced 
by the charge on the dust grain. Because the dust grain is shielded by other particles, 
the electrons and ions only experience the central force when inside a sphere having the 
nominal Debye radius. 

Dust grain charging involves calculating effective collision cross-sections for the elec- 
trons and ions colliding with the dust grain. A useful benchmark reference for this calcu- 
lation is the cross-section of charged particles colliding with a neutral dust grain. Because 
charged particles incident on a neutral dust grain have straight line trajectories, the cross- 
section of a neutral dust grain is just the geometrical area it projects onto a plane, i.e., 
O geometric = mea; The cross-section of a charged dust grain differs from geometric be- 
cause incident electrons and ions are deflected from straight line trajectories by the electro- 
static central force produced by the charge on the dust grain. The respective cross-sections 
for ions and electrons will now be calculated and related to o geometric- 

Fig.17.1 shows the impact parameter 6 and trajectory of an ion colliding with a nega- 
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tively charged dust grain; the corresponding trajectory of an electron would curl outwards 
instead of inwards and so the electron would require a much smaller impact parameter b 
in order to hit the dust grain. We define v to be the initial velocity of an incident particle 
having mass m and charge q, and define Vimpact as the velocity of this particle at the in- 
stant it makes a grazing impact with the dust grain. Using these definitions it is seen that 
conservation of angular momentum imposes the requirement 


Ub = Vimpact d (17.1) 
and conservation of energy impose the requirement 


1 1 
x = 57 Wimpact + qq (17.2) 


where @, is the potential on the surface of the dust grain. Figure 17.1 shows that b is larger 
than rg for an ion. Because the repulsive force between the negatively charged dust grain 
and an electron causes the trajectory of an electron to swerve in the opposite sense from an 
ion, b is smaller than rg for an electron; that is the electron has to be more ‘on-target’ than 
a neutral particle to hit the dust grain whereas an ion can be less ‘on-target’ than a neutral 
particle to hit the dust grain. 

Eliminating Vimpact between Eqs.(17.1) and (17.2) gives 


ene ae 
aw = 5mv*— z + qba (17.3) 


so the effective scattering cross-section is (Allen, AP and Reynolds 1957) 


: 2q@ 

2 d 
a(v) = 1b" = (1 = 2 O geometric- (17.4) 

mv 

For g@qz > 0 the interaction is repulsive and the cross-section is smaller than o geometric; 

whereas for g/, < O the interaction is attractive and the cross-section is larger than 

O geometric. The former case applies to electrons and the latter case applies to ions. 

The total current flowing to the dust grain for attractive interactions is 


EE E fo o(u)uf(v)4mv2d (175) 
0 


However, because in the repulsive situation incident particles having v < ,/2q@,/m are 
reflected and do not hit the dust grain, the repulsive situation cross-section is zero for all 
particles having v < ,/2qé,/m. Thus, the total current for repulsive interactions is 


Lp epulsive = af a(v ju fw )4rv7d (17.6) 
y 2q¢a/m 

Since the region outside the lm fp sphere sketched in Fig.17.1 extends to infinity, particles 

can be considered to have made many collisions before entering the lm fp sphere, and so the 

velocity distribution of particles incident upon the lm fp sphere will be Maxwellian, i.e., 


no v? /v2 
f) = TEE jvm (17.7) 
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where vr = ,/2KT/m is the thermal velocity. The incident particles will be collisionless 
as they travel inside the lm fp sphere. Integration of Eqs.(17.5) and (17.6) gives 


2 2 
Lattractive = af 74 (1 = ee uf (v)4rv7du 
0 mu 
jai X 2 
= 2an'rinoaer | (1- Ga ) e "rdr 
0 


2 qo 
= g1/2 (1 = 4.) NO QUT geometric (17.8) 


and 


ii qo 
Trepiisve = 271/2 rånogvr | (1- a ) e rdr 
dbal KT 


= 2 e7 ata/sT 


que 


NOQUTO geometric: (17.9) 


17.3 Dust charge 


As discussed above, a neutral dust grain inserted into a plasma will become negatively 
charged because vre >> vri. The electron current Te is thus a repulsive-type current and 
the ion current J; is an attractive-type current. As the dust grain becomes more negatively 
charged, |I.| decreases and |J;| increases until J; +I. = 0 at which time dust grain charging 
ceases. 

The presence of negatively charged dust grains affects the quasi-neutrality condition. 
Assuming singly charged ions and a charge of Z4 on the negatively charged dust grains, 
the quasineutrality condition generalizes to 


Nio = Neo + ZaNao- (17.10) 


Since the ion density is unaffected by the charging process, it is convenient to normalize all 
densities to the ion density and define 


Z 
pLa (17.11) 
Nio 
so that n 
L ey (17.12) 


is 
where neo, nio, and nao are the volne aversi electron, ion, and dust grain densities. 
Furthermore, if rg is small compared to the effective shielding length Aq, then the dust 
grain charge Z, is related to the dust grain surface potential and the dust grain radius rg via 
the vacuum Coulomb relationship, i.e., 


Zae exp(—ra/àa) 
ÅTEQT a 
Zae 


da = 


— ifra << àa- (17.13) 
Ateora 
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Equation (17.11) can then be written in terms of the dust grain surface potential as 


4 
gi oS (17.14) 
Nige 


It is now convenient to introduce the dimensionless variable 


pan (17.15) 
= ae ; 
so that œ becomes 
a = ATNaor aA; Va (17.16) 
where 
Ti 
E as (17.17) 


Nige 
is the ion Debye length. Large Y4 means that ion fall into a potential energy well much 
deeper than their thermal energy. 

It is also useful to introduce the Wigner-Seitz radius a, a measure of the nominal spacing 
between adjacent dust grains. This spacing is defined by dividing the total volume of the 
system V by the total number of dust grains N to find a nominal volume surrounding each 
dust grain. It is then imagined that this nominal volume is spherical with radius a so that 


vey (17.18) 
3 
in which case 3 
= 17.1 
nao = 73 (17.19) 
and 5 
3raà2; 
= Tadi pa (17.20) 


It is also convenient to normalize lengths to the ion Debye length so 


a= Pi, (17.21) 


where 
P = 35 (17.22) 
and the bar means normalized to an ion Debye length. 
The dust charge can also be expressed in a non-dimensional fashion using Eq.(17.13) 
to give 
Za 
=e 17.23 
AT nioà3; ana ( ) 
The floating condition J; + Ie = O shows that the equilibrium dust potential is given 
by 


O= niour, (1 = a ) — neovr, expleġa/ KT) (17.24) 
KT; 
which can be re-arranged as 
mMeTi i 
(+44) exp(YaTi/ Te) =1—a. (17.25) 


mM Te 
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However, using Eq.(17.21) this becomes 


MeTi / ; 
mi. exp(wqT;/ Te) = 1 — Pya, (17.26) 


(1 +44) 


a transcendental equation relating Y4 and P. 

Equation (17.26) shows that, for a given temperature ratio T; /Te and given mass ratio 
Me/m;, the normalized dust grain surface potential %4 is a function of P, a dimension- 
less parameter incorporating the geometrical information characterizing the dust grains in 
the dusty plasma. Because %4 appears non-al gebraically in Eq.(17.26), it is not possible to 
solve for 7)4(P) without resorting to numerical methods. However, solving the inverse rela- 
tion, namely P(2,), is straightforward because P occurs only once in Eq.(17.26). Solving 
for P gives (Havnes, Goertz, Morfill, Grun and Ip 1987) 


1 1 el; 
pa (1 + = Wi explYaT;/ Te). (17.27) 
Va Va mile 


log Wa 


Figure 17.2: Plots of log P and a v. logy, for Te = 1007; and T, = T;. Logarithms are 
base 10 and ion mass is 40 amu. Since P is proportional to the dust grain density the right 
hand side of these curves, i.e., small P, corresponds to the limit of low dust grain density. 
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The upper plot in Fig. 17.2 shows log P plotted versus log 44 for two cases correspond- 
ing to two typical laboratory configurations where dusty plasma experiments have been 
conducted; the lower plot shows the corresponding dependence of a on Y4. The Te = T; 
configuration corresponds to a potassium Q-machine plasma while the Te = 1007; con- 
figuration corresponds to an argon rf discharge. An ion of 40 atomic mass units has been 
assumed for both cases. Figure 17.2 shows that P varies inversely with 7, up to some 
critical value and then, for %4 above this critical value, P heads sharply to zero for mod- 
est increases in Y4. The lower plot shows that a is near unity to the left of the knee and 
then drops sharply to zero to the right of the knee. Comparison of the solid and dashed 
curves shows that the saturated value of %4 is an increasing function of Te/T;. The satura- 
tion value of 7), is 2.5 for the Q-machine parameters whereas the saturated value of 7), for 
the rf discharge is ~ 2 x 107. Since P is proportional to a~? and hence to the dust grain 
density, the right hand side of these plots (i.e., where P is small) correspond to the limit of 
small dust grain density. Also, comparison of the solid and dashed plots shows that 7, in- 
creases with Te /T;. Since Eq.(17.23) shows that Z4 is proportional to Y4, it is seen that Za 
also saturates on moving to the right in the plots and largest Z4 occurs at high T./T; and 
small dust grain density. 

The downward sloping segment on the left of Fig. 17.2 corresponds to the a ~ 1 
regime which is the situation where the dust grain density is large, nearly all the electrons 
are attached to the dust grains, and the plasma has almost no free electrons. On the other 
hand, the saturated limit of 7), on the right of Fig. 17.2 corresponds to the a << 1 regime 
which is where there is minimal depletion of free electrons, the dust grain density is low, 
and there is a very high charge on each of the relatively small number of dust grains. 

There are thus three regimes: 

1. The regime well to the left of the knee in the curves. Here a = 1 and nearly all 
electrons are attached to the dust grains. To the extent that a approaches unity, the 
dust grains replace the electrons as the negative charge carriers. 


2. The regime to the right of the knee in the curves. Here a << 1, most electrons are 
free, and the potential of an individual dust grain is very high and near at its saturation 
value (right hand side of plots). This regime corresponds to small P and a very small 
dust grain density. 


3. The regime in the vicinity of the knee in the curves. If T, >> T; then it is possible to 
have both a of order unity and %4 large even though w, is not quite at its saturation 
level. In this regime the majority of electrons reside on the dust grains, the dust grains 
are highly charged, and the dust grain density is appreciable. Crystallization of the 
dust grains can occur in this regime since crystallization requires a combination of 
high dust grain charge and small separation between dust grains. 
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Figure 17.3: Dusty plasma parameter space showing contours of constant w,, a, and 
Za/Annio Aa for an argon plasma with T/T; = 100. The experimental line corresponds 
to the density range of the Chu and I experiment (Chu and I 1994); these plots are from 
Bellan (2004). 


17.4 Dusty plasma parameter space 


A dusty plasma is characterized by its normalized dust radius 7g and normalized dust in- 
terparticle spacing a. These quantities determine P and, given P, the normalized dust 
surface potential %4 is obtained by solving Eq.(17.27). Knowing P and 7, gives a using 
Eq.(17.21). The consequence of this logical chain of argument is that œ can be considered 
to be a function of r4 and a. 

A dusty plasma parameter space can thus be constructed where a is the horizontal com- 


17.5 Large P limit: dust acoustic waves 491 


ponent and 7, is the vertical component so that a given dusty plasma would correspond to 
a point in this parameter space. The quantities Y4, a, and Z4/ Anni AS; are all functions 
of a and 7, and so contours of these three quantities can be drawn in the @,74 parameter 
space for specified m./m, and Te /T;. Examples of these contours are shown in Fig.17.3 
for an argon plasma with Te = 1007;. An actual experiment would have specific values of 
a and rq and so would be represented as a point in this parameter space. Variation of the 
ion density in the experiment would change the value of Aq; while keeping the ratio r4/a 
fixed and so would correspond to moving along a sloped line in parameter space. The short 
sloped line in Fig.17.3 represents the density in the dusty plasma experiment by Chu and I 
(1994) to be discussed later (the finite length of this line corresponds to the error bars for 
the density measurement). 


17.5 Large P limit: dust acoustic waves 


The large P limit (i.e., regime 1 discussed above) has nearly all the electrons attached 
to the dust grains so that the plasma effectively consists of negatively charged dust grains 
and positive ions. A wave similar to the conventional ion acoustic wave can propagate in 
this regime, but the role played by positive and negative particles is reversed: here the ions 
are the light species and the dust grains are the heavy species. In order to appreciate the 
consequence of this role reversal, consider the conventional ion acoustic wave from the 
most simplistic point of view. The wave phase velocity w/k is assumed to be much faster 
than the ion thermal velocity vr; (i.e., cold ion regime) but much slower than the electron 
thermal velocity vre (i.e., isothermal electron regime) so the respective approximations of 
the electron and ion equations of motion are 


0 = —n,.eE—V (n-KT.) (17.28) 
du; 
nimi = n;ZeE. (17.29) 
Adding the electron and ion equations and assuming quasineutrality ne ~ n; Z, gives 
du; 
nimi = —V (nekT.) (17.30) 


showing that the effective force acting on the ions is the electron pressure gradient. This 
force is coupled to the ions via the electric field. In effect, the electron pressure gradient 
pushes against the electric field which in turn pushes the ions. Linearizing Eq.(17.30) gives 
a result similar to a conventional sound wave, except that the system is isothermal with 
respect to the electron temperature (in a normal neutral gas sound wave, the gas temperature 
would appear in the right hand side, the gas would be adiabatic, and so a y would appear 
upon linearizing the gas pressure). Linearization of Eq.(17.30) and invoking the linearized 
quasineutrality relation Nne © nj;1Z, gives 


ðu; 
niom; = = —ZKT.Yna. (17.31) 
The linearized ion equation of continuity is 
ani: 
Mii he i 03 (17.32) 


ot 
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The equations are combined by taking the divergence of Eq.(17.31) and then substituting 
Eq.(17.32) to obtain 


= ens (17.33) 


7 
which is a wave equation with wave velocity c? = ZKT./m,. 

This method can now be generalized to a plasma consisting of ions with charge +e, 
free electrons with charge —e, and dust grains with charge — Ze. The free electron density 
is assumed to be small so that P >> 1 in which case the configuration is on the left of 
Fig.17.2. The wave phase velocity is assumed to be much faster than the dust grain thermal 
velocity, but much slower than the ion/electron thermal velocities so the respective electron, 
ion and dust grain equations of motion are 


0 = —n.eE—V (neKTe) (17.34) 
0 = +n,eE—V (nikT;) (17.35) 

d 
nama" = —ngZaeB. (17.36) 


Adding the above three equations and invoking quasineutrality (Le., ni = NaZa + Ne) 
results in 


d 
mamas = —V (njkT; + nekT.). (17.37) 
In analogy to the conventional ion acoustic wave, here the ion and electron pressures couple 
to the dust via the electric field. Linearization of Eq.(17.37) gives 


O 
Nana = = -KT;Vna — KTV (17.38) 


while linearization of Eqs.(17.34) and (17.35) gives 


0 = —n.peEy—KTeV ney (17.39) 
0 = +n peEi—KT;Vni;. (17.40) 


Eliminating E; between these last two equations shows that 


KTV ne = -22 KL Vni (17.41) 
Nio 
which can be integrated to give 
neo (17.42) 
Nel = — Nir- ; 
` nioTe “7 
Inserting Eq.(17.42) into the linearized quasineutrality expression ni1 = Nai Za + Nei 
gives 
1 
Nii = = nat ZaNar (17.43) 
niole 
and hence 
neo l;/Niole 
Nel = — — op lna. (17.44) 
e0ti 
1+ —— 


niole 
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Substitution for n;ı and n.1 in Eq.(17.38) gives 


Oud 1 — neo/nNi 
NoMa Sai KT; neo/n 0 Zana 

ot 1+ neoTi 

niole 
~ -KT;Z4qVna (17.45) 

since neo << nio and typically T;/Te << nio/neo. 
The linearized dust continuity equation is 
ð 

a nov ig =O (17.46) 


so taking the divergence of Eq.(17.45) and substituting Eq.(17.46) gives 


2na LakT; 2 
= — Vna 17.47 
PIE ae ( ) 


which is a wave with phase velocity 


Zo (17.48) 
Maq 
This wave is called the dust acoustic wave and its phase velocity is extremely low because 
of the large dust grain mass. 

This analysis could be extended to include finite kàp terms as obtained by using the 
full Poisson’s equation instead of making the simplifying assumption of perfect neutrality. 
A Vlasov approach could also be invoked to demonstrate the effect of Landau damping. 
Rather than work through the details of these extensions, one can argue that the a ~ 1 dusty 
plasma is effectively a two-component plasma where the heavy particles are the negatively 
charged dust grains and the light particles are the positively charged ions. The previously 
derived results from both two-fluid theory and Vlasov theory could then be invoked by 
simply identifying the heavy and light particles in the manner stated above. Thus by making 
the identification, 


ion acoustic wave | dust acoustic wave 


inertia provided by ions dust grains 
restoring force provided by | electron pressure ion pressure 


taking into account finite en, terms will result in a dispersive dust acoustic wave 


w2 = k ca, , 
1+ k?Ap; 
Similarly, just as an electron flow velocity faster than the ion acoustic phase velocity would 
destabilize ion acoustic waves via inverse Landau damping, a Landau analysis would show 
that an ion flow velocity faster than the dust acoustic phase velocity would destabilize dust 
acoustic waves. 
Destabilized dust acoustic waves have been observed in an experiment by Barkan, Mer- 
lino and D’ Angelo (1995). The phase velocity of these waves was of the order of 10 cm/s 
which is an order of magnitude less than a human walking speed. 


(17.49) 
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17.6 Dust ion acoustic waves 


The presence of dust grains can also substantially modify the propagation properties of 
conventional ion acoustic waves. In a conventional electron-ion plasma, ion acoustic waves 
can propagate only if Te >> T;. This is because strong ion Landau damping occurs when 
the wave phase velocity is of the order of the ion thermal velocity and this is impossible 
to avoid unless Te >> T; since the conventional ion acoustic phase velocity scales as 
2 = (YKT; + KT.) / m3. 

As in the conventional ion acoustic wave, the derivation of dust ion acoustic waves 
involves the linearized electron and ion equations 


0 = —neoeE1—KTeVNnei (17.50) 

om 2ni = nioeE1—yKT;Vni (17.51) 

but here the possibility of finite T; has been retained in order to allow consideration of 

the Te ~ T; regime. The wave frequency is assumed to be sufficiently high so that the 

dust grains are unable to respond to the wave; the dust grains are effectively considered 

as being infinitely massive and therefore stationary. In this limit the dust grains contribute 

to the equilibrium quasineutrality condition njo = Neo + Zanao but since the dust grains 

are assumed stationary their density cannot change and so the linearized quasi-neutrality 

condition is n;; = Ne. Thus, infinitely massive dust grains affect the equilibrium electron 
density, but not the perturbed electron density. 

Eliminating E; between Eqs.(17.50) and (17.51) gives 


Ou; Ni 
nomi = aa KT2V Ne —yKT; Vni. (17.52) 
Since nj = Nei and Nio/Neo = Nio/(Nio — Zanao), Eq.(17.52) becomes 
Ou; 1 
iMi k= Te N: p TT; V i- 17.53 
nym Dt EET K Nni—yK n ( ) 
Taking the divergence and using Eqs. (17.11) and (17.32) gives 
ani 1 
mi Ar = G rT wni) Vna (17.54) 


which gives the dust ion acoustic wave (Shukla and Silin 1992), a wave with phase velocity 


1 RKT KT; 
bra =J H=. (17.55) 


— Q n; Mi 


As qa approaches unity, the dust ion acoustic wave phase velocity becomes much greater 
than the ion thermal velocity even in a plasma having T; = T.. The dust ion acoustic wave 
thus propagates without being attenuated by ion Landau damping in a T; = T, plasma 
having a ~ 1, which corresponds to the left side of Fig.17.2. Thus, the presence of a large 
dust grain density enables the propagation of ion acoustic waves that normally would be 
damped in a Te = T; plasma; this has been observed in experiments by Barkan, D’ Angelo 
and Merlino (1996). 


17.7 The strongly coupled regime: crystallization of a dusty plasma 495 


17.7 The strongly coupled regime: crystallization of a dusty 
plasma 


The mutual repulsive force between two negatively charged dust grains scales as 77 and 
so will be very large for highly charged dust grains. In the extreme limit of this situa- 
tion the electrostatic potential energy between dust grains might exceed their kinetic en- 
ergy so that the grains would tend to form an ordered, crystallized state. The possibil- 
ity that dust grains might crystallize was first suggested by Ikezi (1986) and has since 
been demonstrated in a number of experiments (Chu and I 1994, Melzer, Trottenberg and 
Piel 1994, Thomas, MorfillL, Demmel, Goree, Feuerbacher and Mohlmann 1994, Hayashi 
and Tachibana 1994, Nefedov, Morfill, Fortov, Thomas, Rothermel, Hagl, Ivlev, Zuzic, 
Klumov, Lipaev, Molotkov, Petrov, Gidzenko, Krikalev, Shepherd, Ivanov, Roth, Binnen- 
bruck, Goree and Semenov 2003, Morfill, Annaratone, Bryant, Ivlev, Thomas, Zuzic and 
Fortov 2002). The threshold criterion for crystallization will be discussed in this section 
following a model by Bellan (2004b). The threshold is determined by considering certain 
issues relating to the validity of the conventional Debye shielding model and the Boltzmann 
relation. 

Large Za corresponds to large 7, which in turn corresponds to operating towards the 
right of Fig. 17.2. Since the location of the saturation value of Y4 increases with Te /T;, 
very large Y4 can occur if Te >> T;. The repulsive force also scales inversely with the 
square of the distance separating the two dust grains, i.e., the repulsive force also scales 
as nels Since P is proportional to ngo, the maximum repulsive force would be obtained 
around the knee in the Te >> T; curves in Fig. 17.2 since at this location it is possible to 
have large 7, without ngo becoming infinitesimal. 

The repulsive electrostatic force between two dust grains is attenuated by Debye shield- 
ing. This shielding can be calculated by considering a single dust grain to be a test particle 
immersed in a plasma consisting of electrons, ions, and other dust grains which will be re- 
ferred to as “field” dust grains. The test particle will be completely shielded beyond some 
critical radius. A field dust grain located beyond this critical radius will experience no in- 
teraction with the test particle dust grain whereas if the field dust grain is located within the 
shielding radius, it will experience an enormous repulsive force. The test particle dust grain 
thus acts like a finite-radius hard sphere in its interactions with field particle dust grains. 

A quantitative model for these interactions between dust grains can be developed by 
considering Poisson’s equation for a dusty plasma, 


V? = = (nie — nee — Zanae) . (17.56) 


The usual test-particle argument (see p.8) involves linearization of the Boltzmann relation 
for each species to obtain a linearized density for each species. These linearized densities 
are then substituted into Poisson’s equation resulting in the Yukawa-type solution, 


dt 
Atteor 


=- exp(—r/Ap) (17.57) 


where Ap” = X Ap-. However, this linearization is based on the assumption |q;¢/«T>| 
<< 1 which is clearly not true in the vicinity of a highly charged dust grain. This inconsis- 
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tency cannot be resolved in fluid theory and it is necessary to revert to the more fundamental 
Vlasov description. 

Particles are characterized by a velocity distribution function f(r, v) in the Vlasov de- 
scription and the Vlasov equation describes the time evolution of f. The test particle loca- 
tion is defined to be at the origin of a spherical coordinate system and the system is assumed 
to be spherically symmetric so that the only spatial dependence of ¢ and f is on the spheri- 
cal radius r. The nominal location of the last collision made by a particle incident upon the 
test particle is the collision mean free path Am 7p) and since any particle incident upon the 
test particle will have traveled many mean free paths, the velocity distribution of particles 
entering a sphere of order l,, 7 will be Maxwellian. Because the incident particles are col- 
lisionless within the Am fp sphere, their velocity distribution function must be a solution to 
the collisionless Vlasov equation inside the sphere. The boundary condition this solution 
must satisfy is that its large r limit should correspond to the collisional (i.e., Maxwellian) 
distribution. 

Solutions to the collisionless Vlasov equation are functions of constants of the motion 
as was shown in Sec.2.2 and the appropriate constant of the motion here is the particle 
energy W = m,v2/2+¢,¢(r). Hence the distribution function in the collisionless region 
near the test particle must be 


3/2 2 
fn (V) = neo ( m ) exo ( ist Zoet) (17.58) 


2rKTo KT, 


since this maps to a Maxwellian distribution at large distances r where ¢(r) — 0. 

A negatively charged particle such as a dust grain or an electron experiences a repulsive 
force upon approaching the dust grain test particle and so slows down. Some approaching 
negatively charged particles reflect and so the minimum velocity of electrons or dust grains 
approaching the dust grain test particle is zero. The density of these particles will thus be 


i. fa (r, v) dĉv 
0 


= noo exp (-" | (17.59) 

KT, 
which is the same as the fluid theory Boltzmann relation. It is useful at this point to change 
over to the non-dimensional scalar 7 defined in Eq.(17.15). Since the dust grains are neg- 
atively charged, w is large and positive in the vicinity of a dust grain. The respective 


normalized electron and field dust grain densities are thus 


No 


Ne 


= exp(—4T;/Te) (17.60) 
Neo 
“t= exp(-24) (17.61) 
Ndo 
where 7 
Z = ZaT;/Ta. (17.62) 


Both the electron and field dust grain densities decrease in the vicinity of the dust grain test 
particle. However, because Z is typically very large and T; /Te is assumed to be very small, 
the field dust density scale length is much shorter than the electron density scale length. 
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Ion dynamics are qualitatively different because all ions approaching the dust grain are 
accelerated, leading to the situation that no zero velocity ions exist near the dust grain. In 
particular, an ion starting with infinitesimal inward velocity at infinity where 7 = 0 has 
nearly zero energy, i.e., W ~œ 0. The energy conservation equation for this slowest ion is 


mv? /2 + e¢(r) =0 (17.63) 


and so the velocity for this slowest ion has the spatial dependence 


Umin = ay (17.64) 


Mi 
The ion density is thus 


a fi (r, v) dv 


Umin 


, 3/2 s oœ 2 
» (ain) e) e)a 
no- exp (Y) P exp (—€°) £as. (17.65) 


This can be expressed in terms of the Error Function 


ni(r) 


erfz= = Hk exp(—£7)dé (17.66) 


and, in particular, using the identity 


= i apie) Gag = = | dé exp(—£?) = l d (Gexr(-€) 
(17.67) 


it is seen that (Laframboise and Parker 1973) 


2 Sei 4) (1- erf vv) + avi. (17.68) 


Nio 


The error function has the small-argument limit 


lim erf z ~ — (17.69) 


= 


and so for Y << 1 
Ni 


=1+% (17.70) 


i0 
which is identical to the Boltzmann result given by fluid theory. 
However, because i 
lime * (1-— erf z) = 0, (17.71) 


when % >> 1 the ion density has a non-Boltzmann dependence 


= avi (17.72) 


Nio 
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which is much smaller than the exp(~) dependence predicted by the Boltzmann relation. 
Poisson’s equation can be written non-dimensionally as 


10 (9Y) mM ne Zang 
r2 or or B nio nio nio 
& e(a a (17.73) 
nio Neo Ndo 


where, following the normalization convention introduced earlier, 7 = r /Aqi. Upon substi- 
tuting for the normalized densities, Poisson’s equation becomes 


12 (aM) =o (1-et(Va)) 


-_o~ 


vacuum 


(17.74) 


Because w becomes large near the dust grain, this equation is highly non-linear and so 
the linearization technique used on p. 8 for the conventional Debye shielding derivation 
cannot be invoked. Instead, an approximate solution to Poisson’s equation is obtained by 
separating the collisionless region around the dust grain test particle into three concentric 


layers (regions) according to the magnitude of w : 


Region 1 Region 2 Region 3 
Definition pa>y>l 1>4>1/Z 1/Z >% 
Location adjacent to dust grain | middle spherical layer outer layer 
Ions non-Boltzmann, fast Boltzmann Boltzmann 
Electrons Boltzmann Boltzmann Boltzmann 
Dust zero density zero density Boltzmann 
Dependence | vacuum-like w/const. | growing/decaying Yukawa | decaying Yukawa 


Poisson’s equation has the following approximations in the three regions 


Region 1 


Region 2 


Region 3 


NOE (ck 

ror\ arj 

ESA pL =a Qa 
mor \" oF) 

10 (40 2 
ae x) Shean] 


The reasons for these approximations are as follows: 


(17.75) 


(17.76) 


(17.77) 
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In region 1, 7 >> 1 so that the ion density is given in principle by Eq.(17.72) which 
would give a right hand side scaling as \/7. This ion term would dominate the electron and 
dust terms since the latter are always less than unity in this region. Thus, keeping just the 
ion term on the right hand side, Poisson’s equation in region | reduces to 


ee E (17.78) 
72 or \" o)” Tp f 
A A 


however, the ion term above can be neglected compared to the vacuum term because 
Vw >> 1 since y >> 1. Thus Poisson’s equation can be approximated in region 1 by 
just the vacuum term. This assumption is quite good near the dust grain surface where W is 
indeed very large compared to unity, but becomes marginal when w approaches unity at the 
outer limit of region 1. 

In region 2, the ions have a Boltzmann distribution so n; /neo = 1+4. Since T;/T. << 
1, the normalized electron density is ne/neo = exp(—WT;/T-) ~ 1. Because Zy >> 1, 
the dust density is nearly zero in this region. 

In region 3, Eq.(17.74) approximates to 


1 oy DT; = 
Sere), 2) Pe ae) (t= (1- ZW) a 
r2 OF Or BE Te aap ae 
———— —— mamm 


ions 
vacuum electrons 


~ (L+aZ)y (17.79) 


where 7; /T, << 1 has been used. For finite a, the dust term dominates because aZ >> 1. 

The system has two boundary conditions, one at the dust grain surface and the other at 
infinity. The former occurs in region 1 and is set by the radial electric field on the dust grain 
surface. This boundary condition is obtained from Gauss’ law which relates 0w/OF at the 
dust grain surface to the dust grain charge and radius. The boundary condition at infinity 
can be considered as the large 7 limit for region 3 and requires ~ to vanish as 7 goes to 
infinity. 

If the same form of Poisson’s equation were to characterize regions 1, 2, and 3, then 
the system would be governed by a single second-order ordinary differential equation and 
the two boundary conditions described in the previous paragraph would suffice to give 
a unique solution. However, the equations in regions 1, 2, and 3 differ from each other 
and so must be solved separately with appropriate matching at the two interfaces between 
these three regions. The criteria for matching at the interfaces, determined by integrating 
Poisson’s equation twice across each interface, are that both w and its radial derivative must 
be continuous at each interface. An important feature of this analysis is that the locations 
of the two interfaces are unknowns to be determined by solving the matching problem. 
Since regions | and 2 are of finite extent, both decaying and non-decaying 7 solutions are 
allowed in these regions. However, only a decaying solution is allowed in region 3. 

Region 1, which is governed by Eq.(17.75), has vacuum-like solutions of the form 
w ~ cr! + d where c and d are constants. The constant c is chosen to give the correct 
radial electric field at the surface of the dust grain test charge and d is chosen so that y = 1 
at r; which is the as-yet undetermined location of the interface between regions 1 and 2. 
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Using Gauss’s law to prescribe the radial electric field at the dust grain surface and also 
choosing d so that Y = 1 atr = F, gives 


aa? (1 1 +) E 
} = r 
A ae (17.80) 


In region 2, which is governed by Eq.(17.76), the effective dependent variable is Y + a 
which has growing and decaying Yukawa-like solutions ~ exp(+7)/7. It is convenient to 
write the region 2 solution as 


1 
ro (3 +a) cosh (P= ro) + Bsinh (=Po) = ar 


Ya(F) = - (17.81) 


where the coefficients have been chosen so that y = 1/Z at Fo, the as-yet undetermined 
location of the interface between regions 2 and 3. The coefficient B is also undetermined 
for now, and it should be noted that % is independent of B atr = 75. 

Region 3, governed by Eq.(17.77), has a solution which can be expressed as 


(7) = = exp ( VaZ +1 (7 7o)) (17.82) 
r 
where the coefficients have been chosen to give y = 1/Z at Fo, the interface between 
regions 2 and 3. 
The condition that 07/07 is continuous at 7, gives 


B=a-2vaZ +1 (17.83) 


and this result completes the matching process at 7, since both w and 0w/OF have now 
been arranged to be continuous at 7,. Since %4 = 1 has already been arranged at 7;, in 
order to have continuity of ~ at r; it is necessary to set Ys = 1 at 7;, i.e., 


1 
Ti =o € + a) cosh (7; — To) + B sinh (7; — To) — ari. (17.84) 


All that remains to be done is to arrange for continuity of Ow /Or at r = 7;. Since Y 
has already been arranged to be continuous at T = 7;, continuity of ô (rw) /Or at T = 7; 
implies continuity of 0W/dr. This means that continuity of OW/Or at r = T; is attained 
by equating the derivatives of the numerators of the right hand sides of Eqs.(17.80) and 
(17.81) at 7;,ie., continuity of Ow /Or at r = 7; is attained by setting 


3 1 
l->- =o (z + a) sinh (7; — ro) + B cosh (Ti — To) —a. (17.85) 


Since Z >> 1 and a is of order unity, Eqs.(17.84) and (17.85) can be approximated as 


(L+a)ri = af, cosh (r; — To) + asinh (7; — To) (17.86) 


1 a3 
Lee 


3 ar, sinh (r; — ro) + a cosh (7; — Fo). (17.87) 
Ti 
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These constitute two coupled nonlinear equations in the unknowns 7; and 7, and can 
be solved numerically. Since the parameters a and @ in these equations are functions of 
position in dusty plasma parameter space, Eqs.(17.86) and (17.87) can be solved for 7; 
and 7, at any point in dusty plasma parameter space. Because a different solution set 
{7,70} exists at each point in dusty plasma parameter space, 7; and 7, may be considered 
as functions of position in dusty plasma parameter space, i.e., 7; = 7;(@,74) andr, = 
To (a, Ya ) 

Now consider the density of the field dust grains in the vicinity of r = fo. For r > To 
the normalized potential is prescribed by Eq.(17.82) and decays at the dust Debye length 
which is an extremely short length because Z >> 1. This precipitous spatial attenuation 
of w for 7 > To means that dust grains are completely shielded from each other when their 
separation distance slightly exceeds 7,. If this is the case, then the dust grains will not 
interact with each other; i.e., they can be considered as a gas of non-interacting particles. 
On the other hand, if 7 < ře, the normalized potential 7 then exceeds 1/7 and the dust 
grains will experience a strong mutual repulsion. This abrupt change-over is implicit in 
Eq.(17.61) which indicates that na/nao is near unity when wy << 1/Z (1.e., just outside 
r = fF) but near zero when w becomes significantly larger than 1/Z (i.e., just inside 
T = fo). Hence, a dust grain test particle can be considered as a hard sphere of radius 7, 
when interacting with field dust grains. 

Since the nominal separation between dust grains is a, if @ > 7, then each dust grain is 
completely shielded from its neighbors in which case the dust grains behave as an ideal gas 
of non-interacting particles. However, if @ < 7, is attempted, each dust grain experiences 
the full, unshielded repulsive force of its neighbors. This extreme repulsive force means 
that it is not possible for a to become less than 7,. In effect, each dust grain sees its neighbor 
as a hard sphere of radius 79. Thus, when a = 7, the collection of pressed-together dust 
grains should form a regularly spaced lattice structure with lattice spacing of order a = fo. 
The dusty plasma has thus crystallized. 


The condition for crystallization of a dusty plasma is thus 


ā < F(a, Fa) (17.88) 


which defines a curve in dusty plasma parameter space. Figure 17.4 shows a plot of con- 
tours of 7, /a in dusty plasma parameter space; for reference the region in parameter space 
associated with the Chu and I dusty plasma crystallization experiment is indicated as a bold 
line (as mentioned before, the finite length of this line corresponds to the density measure- 
ment error bars). The contour 7,(@,74)/@ = 1 gives the crystallization condition; above 
this contour the dust grains are crystallized. It is also necessary to have rg small as was 
specified in Eq.(17.13) so that 7, is not attenuated from its vacuum value by the effect of 
the shielding cloud. 


Figure 17.5 shows plots of logig Y, Y, 1044, ne/neo, Ni/Nio, and na/nao as functions 
of r for a dusty plasma on the verge of condensation. The functional form of % is obtained 
from the 7; and 7, values determined by solving Eqs.(17.86) and (17.87). The extremely 
rapid cutoff of the potential at ro, the interface between regions 2 and 3 is evident; the 
characteristic scale of this cut-off is the dust Debye length. The normalized densities are 
plotted using Eqs.(17.60), (17.68), and (17.61). The field dust grain density vanishes for r 
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inside 7,, consistent with the hard-sphere behavior of the dust grain test particle. Although 
w is tiny at the interface between regions 2 and 3, it nevertheless affects the average dust 
grain density profile at this location because of the extremely large charge on the dust 


grains. 
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Figure 17.4: Contours of 7,/a@ as determined from solution to matching problem. Conden- 
sation (crystallization) is predicted to occur above the 7,/a@ = 1 line. Chu and I (1994) 
crystallization experiment, shown as bold line, is seen to lie in the condensation region. 
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Figure 17.5: Plots of logio Y, Y, 1044, ne/neo, ni/nio, and na/nao as functions of 7 for 
a dusty plasma on the verge of condensation. Dust grain radius is at left of region 1, 
parameters are from Bellan (2004D). 
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17.8 Assignments 


Assuming T = 3 eV, T; = 0.03 eV, and n; = 10° cm~ 3, estimate to within an order 
of magnitude how long it would take for 5 um dust grains with density ng = 10° 
cm~? to become fully charged. Hint- estimate the initial electron current impinging 
on a dust grain and compare this to the final charge on the dust grain. Use Fig. 17.3 
to estimate Z4. 


. Orbital motion limit and angular momentum: The (OML) approximation ignores the 


effective potential resulting from centrifugal force. The validity of this approxima- 
tion is examined here for the cases of an algebraically decaying central force and an 
exponentially decaying central force. 


(a) Show that when centrifugal force is taken into account, the radial equation of 
motion for a particle in a spherically symmetric electrostatic potential (r) is 


where the effective potential x is given by 


mb2v2 
2r2 


and m, b, and v are defined as in Sec. 17.2. 


x=qġ(r)+ 


(b 


T 


Show that if gd is negative, a local minimum of the effective potential exists at 
some radial position rı; compare this situation to planetary motion in the solar 
system. 


(c) Show that if qd ~ —r~—” there can also be a local maximum at a radius r where 
rg > rı providing n satisfies a certain condition. What is this condition for n? 
Plot (7) for the situation where there is a local maximum. Can a particle inci- 
dent from infinity reach r; if its energy is less than x (72). What sort of condition 
does this place m,v, and b. Under what circumstances is the OML approxima- 
tion valid? Plot trajectories for the situations where the OML approximation is 
valid and situations where it fails. Comment on whether OML is a reasonable 
approximation for dust charging. 


(d) Repeat (c) above for the situation where gd ~ — exp(—r/A). 


Assume the OML approximation is valid so that the effective potential can be consid- 
ered to have a minimum in the vicinity of a negatively charged dust particle. Show 
that, if an incident ion does not hit the dust grain, it will reflect back to infinity. Show 
that a collision could cause an incident ion to be trapped in the effective potential well 
at rı as discussed in problem 1b. Once an ion becomes trapped how long would it stay 
trapped? 


. Dust Alfvén waves: Show that for waves with frequency below the dust grain cy- 


clotron frequency, a dusty plasma will support MHD Alfvén waves with dispersion 
w? = k?v? where vå = B?/nama. 
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. Dust whistler waves: Show that a dusty plasma with a œ 1 will support whistler-like 
waves with dispersion w œ we; cos 0 where weg << wW << Wei- 


. Make a plot like Fig. 17.3 for a dusty plasma with Te = T; and discuss whether such 
a plasma could condense and form crystals. 


. Consider a methane plasma consisting of CH, ions and electrons. Assume that there 
exist some infinitesimal dust nuclei at t = 0 and that these become negatively charged 
as discussed in Sec. 17.2. 

Further assume that any methane ion hitting the dust grain sticks to the dust grain so 
that the mass of the dust grain increases with time. Plot the dust grain radius as a 
function of time. How long would it take for the dust grains to become sufficiently 
large to form a crystal (assume that ng = 10° cm~3, T./T; = 100, Aq; = 100 um)? 


. Consider a weakly ionized dusty plasma with 5 um diameter dust grains, 3 eV elec- 
trons and room temperature (0.025 eV) argon ions. The dusty plasma is located above 
a horizontal metal plate lying in the z = O plane. Because electrons move faster than 
ions and dust grains, the electron flux to the metal plate is initially much larger than 
the ion or dust grain flux. The faster rate of loss for electrons causes the dusty plasma 
to become positively charged so that an electric field develops which tends to retard 
the electrons and accelerate the ions towards the metal plate. Thus, the plasma poten- 
tial is positive with respect to the metal plate and if the plasma potential is defined to 
be zero, then the metal plate has a negative potential, say ġ The electron flux to 
the plate will thus be 


plate- 


Tre = NUTe exp(—debpiate/KTe) 
and, using a sheath analysis as in Section 2.7, the ion flux to the plate is 
TD; = nc, 


where cs = \/KT./m,is the ion acoustic velocity. In equilibrium, the electron and 
ion fluxes will balance so that 


m 
exp(—debpiate/KTe) = = 
i 
or T 
m: 
Pplate = = In a 


where the temperature is expressed in electron volts. The change in potential will 
occur over a distance of the order of the electron Debye length going from the plate to 
the bulk plasma so that 


T Mi 

=- —z/Xpe) ln — 

d(z) > exp(—z/Ape)ln S 

where z is the distance above the metal plate. By considering the combined effect 
of the vertical electric field associated with this potential and of gravity acting on a 
dust grain show that dust grains will tend to levitate above the metal plate. Plot the 
potential energy of dust grains in the combination of the electrostatic and gravitational 
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potential fields. Assume that Z4 = 104 and that the dust grains are made of a material 
which has a mass density of 1 gm/cc. 


. Suppose a negatively charged dust grain has a charge Z4 and a radius rg. Show that the 


potential on the surface of the dust grain is much less than Z4/4aeora if ra >> AD 
where Ap is the nominal shielding length for the shielding cloud around the dust grain. 
What implication does this have for dust charging theory and why do interesting dusty 
plasmas typically have rg << Ap? 
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Appendix A 
Intuitive method for vector calculus 
identities 


Instead of providing the traditional ‘back of the book’ list of vector calculus identities, an 
intuitive method? for deriving these identities will now be presented. 

The building blocks for this are the rules for the vector algebra triple product and for 
the dot-cross product. These two vector rules will first be reviewed and then the method 
for combining the vector rules with the rules of calculus will be presented. 

Vector algebra triple product 

There are two forms for the vector triple product, depending on the location of the 
parenthesis on the left hand side, namely: 


Ax! Bxc |= B A-C =C A-B (A.1) 
middle outer middle other two outer other two 
dotted together dotted together 
AxB |xC=B A-C = A B-C . (A2) 
SE ` A ` A Ae SA —— 
outer middle middle other two outer other two 
dotted together dotted together 


Both of these distinct forms can be remembered by the two-word mnemonic “middle- 
outer”. The words middle and outer are defined with reference to the left hand side of 
both equations; middle refers to the middle vector in the group of three and outer refers to 
the outer vector in the parentheses. The first term on the right hand side is in the vector di- 
rection of the “middle” vector with the other two vectors dotted together; the second term 
on the right hand side is in the direction of the “outer” vector with the other two vectors 
dotted together. 

Dot-Cross product 

Now consider the combination of dot and cross, A - B x C. Here the rules are that 
the dot and cross can be interchanged without changing the result and the order can be 
cyclically permuted without changing the result, but if the cyclic order is changed then the 


5 This method was explained to the author by the late C. Oberman. 
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sign changes. Thus 


A-BxC=AxB-C interchange dot and cross, (A.3) 
A-BxC=BxC-A permutation maintaining cyclic order, (A.4) 
A-BxC=-A-CxB permutation changing cyclic order. (A.5) 


Derivation of the vector calculus identities 

The basic idea is to replace one of the vectors by the V operator and then rearrange 
terms and if necessary add terms. The criterion for these maneuvers is that, just like getting 
the right piece placed in a jigsaw puzzle, here all applicable rules of vector algebra and of 
calculus must be simultaneously satisfied. The simple example 


V- (YQ) =YV: Q+QVý (A.6) 
illustrates this principle of satisfying the vector algebra and the calculus rules simultane- 
ously. Here the dot always goes between the V and the Q in order to satisfy the rules of 
vector algebra and the V operates once on Q and once on w in order to satisfy the product 
rule (ab)’ = ab’ + a'b of calculus. 

A less trivial example is V - (B x C). Here the V must operate on both B and C 
according to the product rule so, neglecting vector issues for now, the result must be of the 
form BVC + CVB. This basic product rule result is then adjusted to satisfy the vector 
dot-cross rules. In particular, the dots and crosses may be interchanged at will and the sign 
is plus if the cyclic order is VBC, BCV, or CVB and the sign is minus if the cyclic order 
is VCB, CBV, or BVC. Since the V is supposed to operate on only one vector at a time, 
we pick the arrangement where the V is in the middle and either B or C is to its right 
so that each of B or C has a turn at being acted on by the V operator. The arrangements 
of interest are thus C-V x B and —B-V x C where the minus sign has been inserted to 
account for the change in cyclic order. The result is 


V-(BxC)=CVxB-BVxC (A.7) 

which satisfies both the vector algebra dot-cross rule and the product rule of calculus. 
Next consider the triple product Ax (V x C). Here the V operates only on the C and 
the vector triple product generates two terms as indicated by Eq.(A.1). Expanding the triple 


product according to Eq.(A.1) while arranging an ordering of terms where the V operates 
only on C gives 


Ax! V xC J =[.V.C,]-A-|[A_V C. (A.8) 
Sea NA ~~ Me | “HY 
middle outer middle outer middle outer 
Thus, the first term on the right hand side has its vector direction determined by V with the 
other two terms dotted together, and the parenthesis indicates that the V operates only on 
C. The second term on the right hand side has its vector direction determined by C with 
the other two terms dotted together and again the V operates only on C. 
This can be rearranged as 


(VC)-A=Ax(VxC)+A-VC. (A.9) 
Interchanging A and C gives 
(VA)-C=Cx(VxA)+C-VA. (A.10) 


Adding these last two expressions gives 


(VC)-A+(VA)-C=Ax(VxC)+A-VC+Cx (Vx A)+C-VA. (A.11) 
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However, using the same arguments about maintaining the dot and satisfying the prod- 
uct rule shows that 


V(A-C) = V(C-A) 
(VA)-C+(VC)-A. (A.12) 


Here the V operates once on the A and once on the C according to the product rule and 
the dot is always between the A and the C. Combining Eqs.(A.11) and (A.12) gives the 
standard vector identity 


V(A-C)=Ax(VxC)+A-VO+Cx (Vx A)+C-VA. (A.13) 


Finally, the triple product with two V’s can be expressed as 


Vx{ V xA |= V V.A = y? A (A.14) 
“x n ~~ ~o SA “YEH 
middle outer middle other two other two outer 
dotted together dotted together 
or, without the labeling, as 
VxVxA=VV-A-V?A. (A.15) 


The relationships V - V x A = Oand V x Vw = Ocan be proved by direct evaluation 
using Cartesian coordinates. 


Summary of vector identities 


Ax(BxC) = B(A-C)—C(A-B) 
(AxB)xC = B(A-C)—-A(B-C) 
A-BxC = AxB-C, interchange dot and cross 
A-BxC = Bx C-A, cyclic permutation, cyclic order maintained 
A-BxC = —A-CxB, cyclic permutation, cyclic order changed 
V- (WA) = YV-A+AVY 
V-(AxB) = BVxA-AVxB 
(VB)-A = Ax(VxB)+A-VB 
V(A-B) = Ax(VxB)+A-VB+Bx(VxA)+B-VA 
VxVxA = V(V-A)-V2A 
V-VxA = 0 


VxVwy = 0 


Appendix B 
Vector calculus in orthogonal curvilinear 
coordinates 


Derivation of generalized relations 


Let £1, £2, £3 be a right-handed set of orthogonal coordinates and let s be the distance 
along a curve in three dimensional space. Let h; be the scale factor relating an increment 
in the i” coordinate to an increment in length in that direction so 


dl, = hidr; (B.1) 
an example is the ¢ direction of cylindrical coordinates where dl, = rdo. Since the co- 
ordinates are assumed to be orthogonal, the increment in distance along a curve in three 
dimensional space is 


(ds)? = h? (dx1)? +h? (dx)? + R? (dz). (B.2) 


The {x1, £2, £3 } coordinates and corresponding {h1,h2,h3} scale factors for Cartesian, 
cylindrical, and spherical coordinate systems are listed in Table B.1 below. 


coordinate system | distance along a curve {@1, £2, £3} | {h1, h2, ha} 
Cartesian (d£) + (dy) + (dz)? {x,y,z} {1,1,1} 
cylindrical (dr)” + (rdo) + (dz)? {r, 6, z} {1,r, 1} 
spherical (dr) + (rd0)” + (r sin Odd)" {r, 0, o} {1,r,rsin 6} 


Table B.1: Coordinate systems and their scale factors 


The differential of the scalar Y for a displacement by dl, of the coordinate xı is 


dw = diz, - Vw (B.3) 
so the component of the gradient operator in the direction of ĉı is 


. vy a 1 
Gy y= — = — — _,, 
A Y dl 1 hy Ox, 
The partial derivative notation is invoked in the right-most expression because the displace- 
ment is just in the direction of x1. 


(B.4) 


Generalized gradient operator 

Because the coordinates are independent, a displacement in the direction of one coor- 
dinate does not affect the dependence on the other coordinates and so the gradient operator 
is just the sum of its components in the three orthogonal directions, i.e., 


vat 8 a 8 | ts 8 


E hy Ox, i hə Ox2 hg z3 P 
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Since Va, = #1/h1, the unit vector in the x direction is 
È = hı Vzı. (B.6) 
Also, because the coordinates form a right-handed orthogonal system the unit vectors 


are related by 
ĉi x fo = 4&3, Xo x £3 = £1, ĉ3 x £1 = £9. (B.7) 


Generalized divergence and curl 
Let V be an arbitrary vector 


V=V1 41 + Vote + V323 (B.8) 
and consider the divergence of the first term, 
= as (Viho Vro X ha Vz3) 


Êo x £3 
= V(hgh3Vy) - 
(hoh3 V1) Raha 
ĉı 
= - V (hah V; 
ogg V ahai) 
1 8) 
= ——— (hoh3V\). B. 
hilt Ory ne 
Extending this to all three terms gives the general form for the divergence to be 
1 6) 8) ð 
V-V= — (hoh V; — (hih V; — (hihəV3) |) . B.10 
sahara (oy (oto¥a) + are (ahaa) + 2 (haa) (B.10) 
Now consider the curl of the first term of the arbitrary vector, namely 
Vx (Vie1) = VX (Viti V21) 
= V (Vi A) x Vzı 
1 
= >V (Vh) x ĉi (B.11) 
hy 
and so 
1 ` 1 5 1 5 
vxV =y (Viki) x @ + N (Vz2h2) x ĉ2 + aa (V3h3) x #3. (B.12) 
1 2 3 


The component in the direction of ĉı is 


%:-VXxXV 


1 
rad (Voha) x Xo : Ly +—V (V3h3) x x3 : £1 
2 


-i (Vaha) (B.13) 
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Thus the general form of the curl of the arbitrary vector is 


VxV = ( 


hy X41 het hgt3 
aa 0/Ox, O/Oxq 0/dz3). (B.14) 
1°23 | WV hV hV 


Generalized Laplacian of a scalar 


The Laplacian of a scalar is 


Vy=V. (B.15) 


Using Eq.(B.10) this becomes 


eee. Ə (hh Y) Ə (hah Y) 3 (hıh OY 
Om haha Ox, hy Ox, Ox2 ho Ox2 0x3 hg Ox3 : 
(B.16 


The Laplacian of a vector in general differs from the Laplacian of a scalar because there 
are derivatives of unit vectors. However the general formula is overly complex and so the 
Laplacian of a vector will only be calculated for cylindrical coordinates which are typically 
of most interest. 


ĉı OY #2 0~ | 23 OW 
hy Ox, he Ox2 hg 0x3 ` 


Application to Cartesian coordinates 
{z1 £2, £3} = {ayy z}; {h1, h2, h3} = {1,1,1} 


Vy = a5 105, as (B.17) 
ôV, OV, ƏV, 
PN s a A (B.18) 
WON a (Z Te) a- (= a5 
& _ e) F (B.19) 
Vy = 2 ' AE oe (B.20) 
VV = at ag - (B.21) 
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Application to cylindrical coordinates 


{£1,%2, £3} = {r,, z}; {hi, h2, h3} = {1,r, 1} 


Ov 93V | OY 
Or ` rôp |” Oz 


10 _1OVs | OV, 
~ Pp Or ve) r 06° Oz 
E 10V, OVs As OV, OV, p 
= r Ob Oz '\ az Or 


F (+. (rVg) ad Z 


_ 19 (ay) 10% ay 
or Or Or) ` r2 3g? 82 


1/a mY 


Laplacian of a vector in cylindrical coordinates 
Before proceeding, note that the cylindrical coordinate unit vectors can be expressed in 
terms Cartesian unit vectors as 


sO 


Thus 


and 


f = ĉcoso +s o 
Q = —ĉsin ġ +Å cos p 
O. + O- x 
ap TP T f 


521 


(B.22) 


(B.23) 


(B.24) 


(B.25) 


(B.26) 
(B.27) 


(B.28) 


(B.29) 


(B.30) 


(B.31) 


(B.32) 


These results can now be used to calculate V? (V.F) and V? (ved) which can then be 
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used to construct the full Laplacian. The first calculation gives 


V? (VF) = V-V (V°) 
= V-((VV,)F+V,VF) 


= #V°V,+(VV,)-VF+tV-: (V,VF) 
= PV?V, +2VV,-VFLV,V7F 
2d V, 0? 
= vy, +2. wy, 4 z TA 
26 ƏV, V, 
PE aw2 r ies 
= PN V; ae 72 Ob =, rd (B.33) 


while the second calculation gives 
V? (vs) = v-v (v2) 
= V. (Vvo) o+ VeV6) 
= V?Vy + 2VVy-Vb+VgV7o 
= 6V°Vs- pa ae (B.34) 


Since V? (V72) = 2V?V, it is seen that the Laplacian of a vector in cylindrical coordi- 


nates is 


2 ƏV, V, 
Vv = #(V*V,-55%-4 
( "72 Ap $) 
A 2 ƏV, V, 
4 2 Sire a 
O) (v Vg + 5 ad +) 
ee Vi, (B.35) 


QO Və ð À 
vyv = (vy +22 4y, (vir V. v2) 
( Or r Op z) Aat Kaa 
V2 
Sg eee ae Lee 
Or r Op Oz r 


(v „ Ve OMe pyle ) (B.36) 
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Application to spherical coordinates 


{z£1, £2, £3} = {r,0, o}; {hi, he, ha} = {1,r,r sin 0} 


E A 0 Ow $ Ow 
Or roô rsin@ ð 
VV = Gage (OV) tranga COM + aap ag V 
VxV = alno- e) 
—_ (Z Ž Worsind)) 0 
: (> (Vor) aa 
| j 2 
= : z (e) T (ino) | sa Beh 
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(B.37) 


(B.38) 


(B.39) 


(B.40) 


Appendix C 
Frequently used physical constants and 
formulae 


Physical constants (to 3 significant figures) 


symbol value unit 
electron mass Me 9.11 x 10-3! kg 
proton mass Mp 1.67 x 107?7 kg 
vacuum permeability [Uo 4r x 1077 N A~? 
vacuum permittivity £o 8.85 x 1071? Fmt 
speed of light c 3.00 x 108 ms ! 
electron charge € 1.60 x 10-19 C 
Avogadro’s number 6.02 x 1073 mol! 
Boltzmann constant =k 1.60 x 10719 Jey! 


Formulae 
(all quantities in SI units, temperatures in eV, A is ion atomic mass number) 
Lengths 


Debye length (p.7) 


1 1 
va 


where ø is over all species participating in shielding and 


Te To 
dy = ESTA E 
N00 4 no, 


Electron Larmor radius (p.234) 


The = m 
[weel 
Ion Larmor radius (p.234) 
VKT: /M; 4 VAT; 
PEEN Mra ig a 
Weil BVZ 


where A is atomic mass and Z is ion charge. 
Electron collisionless skin depth (p.145) 


c 5.3 x10° 
We 7 Vie 
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Ion collisionless skin depth (assuming quasi-neutral plasma so n;Z = ne 


© =2.3x 108,/ Ai 
Wpi Zne 


Frequencies 


Electron plasma frequency (p.126) 


Wpe 1 jn 
é E me =9y eH 
Ip 2T 27 \| EqMe nara 


Ion plasma frequency (p.126) 


f 1 pmi E ER 
pi = Up = gra com Foe 


Electron cyclotron frequency (p.79) 
1 eB 


ee S= a ce| = = 2. 8 10°B H 
f 2T [Mee| 27Me a a 
Ion cyclotron frequency oe 
ZeB ZB 
ci = = [wei] = —— = 1.55 x 10’ H 

f zl |= 27m; s A í 

Upper hybrid frequency E 
fun =A] en + E 


Lower hybrid frequency (p.186) 


fin = 


Diocotron frequency (pure electron plasma, p.462) 


faioc = fa /2 fee 
Velocities 
Electron thermal velocity (p.154) 
2KTo 
Ure = 4|- =5.9 x 105,/T. m s~! 
Me 


Ion thermal velocity (p.154) 


ma fE- pa 
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Ion acoustic velocity (p.(4.33)) 


‘Te Tas t 
cs = 4/2 = 9.8 x 10°4/-2ms7} 
Mm; A 


Electron diamagnetic drift velocity (p.250) 


KT 


1 — 
Ude = — =Vnims 4 
, eB |n n 


1 T: 
vnl BE 


Ion diamagnetic drift velocity (flow in opposite direction from electrons, (p.250) 


1 


KT; 
Ud,e 


e= ms 
qB 


1 T; |1 
—-Vn| = —Vn 
ZB |in 


n 


Alfvén velocity (p.132) 


B B Z : 
va = = = 2.2 x 10! B4 | — m s~! 
VEoniMi Hone Amp/Z neA 


Wave phase velocity (p.202) 


Uph = k — fA 
Dimensionless 
Plasma beta (p.319) 
ne RRL. 2 4.0267 x 10-25 nT 
B? B? 
Lundquist number (p.381) 
S= Hoovab 
n 
Collisions, resistivity, and runaways 
Electron-electron collision rate (p.21) 
2; -ynInA _, 
Vee =4x 10 3/2 


eV 


where In A ~ 10, electron-ion rate same magnitude, ion-ion slower by \/m./m; 


Spitzer resistivity (p.394) 


4ZmA 


n = 1.03 x 107 BP Ohm-m 
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Dreicer runaway electric field (p.395) 


InA 
Epreicer = 5.6 x 107 neZ T V/m. 
Typical neutral cross-section (p.16) 5 
Greu ~ 3 X 1077 m? 
Warm plasma waves 
Electrostatic susceptibility (p.165) 
Se eaa] 
Xe T ype 
where a = w/kvro 
Plasma dispersion function (p.165) 
a = —;, —— 
mE E-a) 
2a? 
Jim, Z(a) = —2a (1 S ; a] + int/? exp(—a”) 
j 1 fi 1 | 3 | | 31/2 2 
ue Ake) z a i 202 dat ` ~ P en) 
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Index 


action 
integral in Lagrangian formalism, 
63 
in wave-wave coupling, 433 
relation to quantum number, 433 
action integral, 67 
adiabatic invariance 
breakdown of, 95 
pendulum, 66 
adiabatic invariant 
general proof for Hamiltonian sys- 
tem, 70 
J, second adiabatic invariant, 88 
mu conservation, 81 
relation to quantum number, 68 
third adiabatic invariant, 89 
adiabatic limit of energy equation, 40 
Airy equation, 241 
Alfven time, 370 
Alfven velocity, 183 
Alfven wave, 131 
compressional (fast), 134, 136 
in parameter space, 196 
inertial Alfven wave, 187, 207 
shear (slow), 137 
two-fluid model, 138 
compressional mode, 146 
inertial Alfven wave, 144 
kinetic Alfven wave, 145 
alpha particle, 296 
Altar-Appleton-Hartree 
relation, 197 
ambipolar diffusion, 18 
analytic continuation, 160 
antenna, 193 
Appleton-Hartree dispersion relation, 
197 


dispersion 
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arc, 290 
astrophysical jet, 295 
axisymmetry, 280 


bad curvature, 305 
ballistic term, 415 
beam echo, 416 
beat waves, 398 
Bennett pinch, 275, 295 
Bernstein waves, 236 
Bessel function model, 358 
Bessel relationships, 233 
beta, 319 
bilinear function, 311 
Bohm-Gross wave, 129 
Boltzmann relation 
difficulties with, 498 
in Debye shielding, 7 
limitations of, 495 
bounded volume, 190 
bounding surface, 190 
break-even 
fusion, 297 
break-even, fusion, 296 
Brillouin backscatter, 438 
Brillouin flow, 460 
Brillouin limit, 462 
Bromwich contour, 156 


Calugareanu theorem, 356 

canonical angular momentum, 99 
in non-neutral plasma, 464 
toroidal confinement, 285 

canonical momentum, 259 
definition of, 64 

cathode emission, 27 

caviton, 214, 451 

center of mass frame dynamics, 382 


Child-Langmuir space charge limited 
emission, 29 
CMA diagram, 188 
coalescence 
mode, 240 
coalescence, mode, 212 
cold plasma wave energy equation, 
221 
collimation, 294 
collision 
frequencies, 14 
relations between cross-section, 
mean-free-path, 27 
collisions 
and quasi-linear diffusion, 426 
Coulomb, 11 
Fokker-Planck model, 389 
qualitative treatment in Vlasov 
equation, 34 
with neutrals, 16 
condensation 
of dusty plasma, 495 
confinement time, 296 
constant of the motion 
definition of, 64 
convective derivative 
definition of, 38 
coronal loop, 295 
correspondence of drift equations 
with MHD, 91 
coupled oscillator, 442 
cross-section 
effective, for dust grain charging, 
484 
fusion, 297 
neutral, 17 
small-angle scattering, 13 
crystallization, 495 


crystallization criterion, dusty 
plasma, 501 
current 


curvature, 93 

diamagnetic, 91, 250 

force-free in Grad-Shafranov 
equation, 281 


529 


grad B, 93 

parallel, 268 

polarization, 93 

poloidal, 280 

sheet, 362 

sheet, in Sweet-Parker reconnec- 

tion, 380 

toroidal, 280 
curvature 

good v. bad, 305, 318 

magnetic field, 271 
curvature current, 93 
curvature drift, 80 
curved magnetic field, 79 
cusp field, 103 
cutoff, 131, 185, 190 
cyclotron motion 

sense of rotation, 29 


daughter wave, 429 
Debye shielding, 129 
derivation of, 7 
solution of pde for, 24 
decay instability, 428, 438 
delta W 
in MHD energy principle, 313 
destructive interference, 202 
deuterium, 296 
diamagnetic, 282 
current, 91 
drift velocity, 250 
dielectric constant, 95 
dielectric tensor, 179 
dielectric tensor elements, 181 
right, left hand polarization, 183 
diffusion 
ambipolar, 18 
magnetic, 364 
quasi-linear velocity space, 405 
diocotron mode, 465 
resistive wall, 
method, 479 
distribution function 
definition of, 31 
for collisionless drift wave, 260 
moments of, 33 


image charge 


double adiabatic laws 
derivation of, 49 
Dreicer electric field, 395 
drift equations, 73 
curvature drift, 80 
derivation of, 75 
drift in arbitrary force field, 78 
grad B drift, 80 
grad B force, 78 
polarization drift, 78 
drift wave, 249 
collisional, 253 
collisionless, 258 
destabilization of, 257, 262 
features, 257 
nonlinear pumping of plasma, 257 
drifts 
curvature, 80 
Ex B, 73, 75 
force, 75 
generalization to arbitrary fre- 
quency, 110 
grad B, 80 
polarization, 78, 80 
dumbell wave normal surface, 191 
dust 
charge on grain, 486 
charging of grains, 485 
levitation, 505 
dust acoustic waves, 491 
dust charging, 504 
dust ion acoustic waves, 494 
dusty plasma, 483 
crystalization criterion, 501 
crystallization (condensation), 495 
parameter space, 490 
dynamic equilibria, 274, 286 


Ecross B drift, 73, 80 
echoes, 412 

higher order, 426 

spatial, 425 
effective potential, 99 
electric field 

runaway (Dreicer), 395 
electromagnetic plasma wave 
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in inhomogeneous plasma, 211 
electromagnetic wave 
derivation of, 130 
in decay instability, 442 
electron plasma wave, 129 
in decay instability, 440 
electrostatic ion cyclotron wave, 263 
ellipsoid wave normal surface, 191 
energy 
inductive, 273 
per helicity, 344 
wave, 219 
energy conservation 
in nonlinear waves, 399 
energy equation 
adiabatic limit, 40 
isothermal limit, 40 
wave, 219 
energy equation, two-fluid, 40 
energy principle, 306 
energy transfer between waves and 
particles, 114 
entropy 
collisions, 43 
conservation of, 413 
of a distribution function, 41 
equilibrium 
impossibility of spherically sym- 
metric MHD, 295 
stable v. unstable, 298 
extraordinary wave, 186 


fast mode, 134, 192 

Fermi acceleration, 88 

first adiabatic invariant, 82 

floating potential, 57 

flow (MHD accelerated), 286 
compressible, 291 
incompressible, 286 
stagnation of, 294 

flux 
accumulation of toroidal flux in 

flow, 294 

frozen-in, inductance, 272 
poloidal, 97 
private and public, 379 


relation to vector potential, 97 
volume per unit, 334 
flux linkage, 346 
flux preservation, 272 
flux surface, 96 
in mirror, 91 
flux tube 
Grad-Shafranov solution for, 295 
Fokker-Planck theory, 382 
force 
axial MHD force, 287 
between parallel currents, 268 
centrifugal, 79 
flux-conserving, 273 
hoop, 270 
non-conservative, 286 
pinch, 269 
force-free 
magnetic field, 344 
force-free equilibrium, 322, 335 
Bessel function model, 358 
Lundquist solution, 358 
force-free magnetic field, 268 
free-energy 
in sheared non-neutral plasma, 477 
frequently used formulae, 524 
frictional drag, 391 
qualitative derivation of, 37 
frozen-in flux 
proof of, 48 
fusion 
break-even, 297 
criteria for, 296 


Gaussian integrals, 59 
geometric optics, 213 
good curvature, 305 
grad B 
current, 93 
drift, 80 
force, 78 
Grad-Shafranov equation, 278 
Green’s function in diocotron mode, 
474 
group velocity, 201 
guiding center 


definition of, 76 


Hall term, 48 
Hamilton-Lagrange 
formalism, 62 
Hamiltonian, 64, 65 
coupled harmonic oscillator, 430 
geometric optics, 213 
Helmholtz equation, 358 
Hermitian part, 223 
hollow profile, 474 
hoop force, 270 


ignitron, 29 
ignorable coordinate, 64 
image charge, line, 479 
inductance, 272 
induction equation, 49 
in magnetic reconnection, 366 
inhomogeneous plasmas, 210 
initial condition 
Poisson’s equation as, 179 
initial value, 158 
instability 
caviton, 451 
decay, 428 
free-boundary, 323, 326 
kink, 325, 331 
parametric, 428 
positron-electron streaming, 150 
Rayleigh-Taylor, 299 
resistive wall, 471 
sausage, 323, 331 
streaming, 149 
ion acoustic velocity 
in sheath, 56 
ion acoustic wave, 129 
in decay instability, 439 
Landau damping of, 170 
Landau instability, 176 
ion saturation current, 56 
ion-ion resonance, 208 
isothermal limit of energy equation, 
40 


J, second adiabatic invariant, 87 


jet 
astrophysical, 295 
joining, in mode conversion, 249 


kappa 
magnetic curvature, 271 
Kelvin vorticity theorem, 463 
kink, 323 
kink instability, 325, 331 
diamagnetic or paramagnetic?, 335 
Korteweg-de Vries equation, 455 
Kruskal-Shafranov stability criterion, 
333 


Lagrange multipliers, 58 
Lagrange’s equation, 64 
Lagrangian 

electromagnetic, 64 
Landau damping, 169 

and entropy, 413 
Landau problem, 153 
Langmuir probe, 56 
Langmuir wave, 129 
Laplace transform, 155 

and ballistic term, 415 
Laplace’s equation, 265 
Larmor radius 

poloidal, 285 
laser fusion, 458 
Lawson criterion, 296 
leap-frog numerical integration, 25 
lightning bolt, 200 
line-averaged density, 131 
linear mode conversion, 241 
linearization, general method, 123 
loss-cone, 87 
lower hybrid frequency, 186 
Lundquist number, 381 
Lundquist solution, 358 


magnetic 
axis, 283 
diffusion, 364 
islands, 376 
reconnection, 360 
shear, 305 
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magnetic field 
force-free, 268 
minimum energy, 343 
most general axisymmetric, 279 
pressure, 268 
reconnection in sheared field, 371 
Solov’ev solution, 282 
tension, 268 
vacuum, 265 
magnetic field curvature, 271 
magnetic helicity, 320, 336, 345 
association with current-driven in- 
stability, 319 
conservation equation, 321, 357 
conservation of, 341 
global, 322 
linkage, 336 
linked ribbons, 358 
twist, 338 
magnetic mirror, 84 
magnetic moments 
density of, 92 
magnetic pressure, 268 
magnetic stress tensor, 271 
magnetic tension, 268 
magnetization, 92 
magnetofluid, definition of, 306 
magnetoplasmadynamic thruster, 290 
magnetron, 481 
Manley-Rowe relations, 430 
Maxwellian distribution 
definition of, 44 
MHD equations 
derivation of, 46 
Ohm’s law, derivation of, 47 
MHD with neutrals, 60 
minimum energy 
frozen-in flux, 272 
minimum energy magnetic field, 343 
moments of distribution function, 33 
momentum conservation 
in nonlinear waves, 399 
mu conservation, 81 
mu, definition of, 82 


National Ignition Facility, 459 


negative energy wave, 225 
neutron, 296 
nominal plasma parameters, 21 
non-adiabatic motion, 95 
non-linear dispersion relation, 444 
non-neutral plasma, 460 

relation to Landau damping, 476 
non-resonant particles 

in quasilinear velocity space diffu- 

sion, 411 
numerical integration of particle tra- 
jectory, 25 


orbital motion limited model, 485, 
504 

ordinary wave, 186 

orthogonal curvilinear coordinates, 
518 

overdense, 197 


parallel current, 268 
paramagnetic, 282 
parameter space, 195 
parametric decay instability, 428, 458 
Parker reconnection model, 378 
Penrose criterion, 172 
phase integral, 232 
phase mixing, 202 
and echoes, 416 
phase space, 30 
physical constants, 524 
pinch 
Bennett, 295 
pinch force, 269 
plasma dispersion function 
definition of, 165 
large argument limit of, 167 
small argument limit of, 167 
plasma frequency 
definition of, 126 
plateau velocity distribution, 412 
Plemelj formula, 171 
polarization current, 93 
relation to capacitance, 95 
polarization drift, 78, 80 
change in particle energy, 120 
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in Alfven wave, 132 
relation to MHD equation of mo- 
tion, 93 
stochastic motion, 106 
poles, 160 
poloidal 
definition of, 279 
poloidal flux, 279 
poloidal Larmor radius, 285 
poloidal magnetic field, 280 
ponderomotive force, 436, 448 
Poynting flux, 220, 309 
Poynting theorem, 357 
pressure 
magnetic, 268 
pressure tensor, definition of, 37 
principal resonances, 190 
private flux, 379 
product rule for oscillating physical 
quantities, 221 
public flux, 379 
pump 
MHD acceleration of fluid, 290 
pump depletion, 457 
pump wave, 429 
pure electron plasma, 460 


QL modes, 199 

QT modes, 199 

quantum mechanics, 323 

quantum mechanics, correspondence 
to, 214 

quasi-linear velocity space diffusion, 
399 

quasi-neutrality, 9 

quasilinear diffusion 

and collisions, 426 
quiver velocity, 436 


radiation pressure, 436 

radius of curvature, 79 

Raman backscatter, 438 

random velocity, 37 

reduced mass, 383 

redundancy of Poisson’s equation, 
178 


refractive index 
definition of, 181 
relaxation, 341 
relaxed state, 344 
residues, 160 
resistive mode, 360 
resistive time, 371 
resistive wall instability, 471, 479 
resistivity 
Fokker-Planck model, 393 
simple derivation of, 17 
resonance, 185, 190 
ion cyclotron, 193 
ion-ion, 208 
resonance cones, 204 
resonant particles 
in quasilinear velocity space diffu- 
sion, 407, 409 
reversal of magnetic field 
spatial (cusp), 103 
temporal, 102 
reversed field pinch, 285, 344 
ribbon, gift-wrapping, 358 
Richardson-Dushman temperature 
limited emission, 29 
Rosenbluth potentials, 388, 395 
runaway electric field, 395, 397 
Rutherford scattering 
derivation, 22 


saddle point, 243 
safety factor, 333 
tokamak, 375 
sausage instability, 323, 331 
scale factor, 518 
scattering 
energy transfer, 23 
large angle, 12 
Rutherford, 11, 22 
small angle, 13 
second adiabatic invariant, 87 
self-adjointness, 310 
self-confinement 
impossibility of, 276 
self-focusing, 438 
separatrix, 284 
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sheath physics, 53 
sheet current, 362 
short wave radio transmission, 228 
slow mode, 192 
slow wave, 133 
Snell’s law, 211 
solar corona loop, 295, 381 
soliton 
interaction, 456 
ion acoustic wave, 454 
propagating envelope, 453 
stationary envelope, 452 
Solov’ev solution, 282 
orbits in, 295 
space-charge-limited current, 27 
spheromak, 285, 344 
stagnation of flow, 294 
static equilibria, 274 
steepest-descent contour, 242 
stellarator, 285 
stimulated Raman scattering, 458 
Stirling’s formula, 58 
stochastic motion, 106 
streaming instability, 149 
stress tensor 
magnetic, 271 
strongly-coupled plasma, 495 
surface wave, 214 
susceptibility 
definition of, 126 
Sweet-Parker 
reconnection, 378 
symmetry 
assumption in MHD jet, 286 
canonical angular momentum & 
toroidal confinement, 285 
in Grad-Shafranov equation, 280 
in Lagrangian formalism, 64 
in solution of Grad-Shafranov 
equation, 280 


taxonomy of cold plasma waves, 188 
Taylor relaxation, 341 

TE mode, 216 

tearing, 364 

temperature limited current, 27 


tension 
magnetic, 268 
thermal force, 59 
third adiabatic invariant, 89 
thruster, 290 
TM mode, 216 
tokamak, 285, 333 
topology 
minimum energy, 273 
torque, 286 
trapped particle 
in mirror, 87 
tritium, 296 
twist and writhe, 351, 356 
two-fluid equations 
derivation of, 36 


underdense, 197 
unperturbed orbit, 232 
untrapped particle 

in mirror, 87 
upper hybrid frequency, 186 


vacuum magnetic field, 265 
vector calculus identities, 515 
vector potential, 320 

relation to flux, 97 
velocity 

Alfven, 132, 183 

bounce-averaged, 90 

definitions of ‘average’ velocity, 

110 

diamagnetic drift, 250 

group, 201 

in sawtooth wave, 111 

in small amplitude oscillatory 

field, 108 

quiver, 436 
virial theorem, 276 
viscosity, 274, 286 
Vlasov equation 

derivation of, 31 

moments of, 34 

treatment of collisions, 34 
volume per unit flux, 334 
vortices 
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magnetic reconnection, 364 
vorticity, 286 
source for, 289 


wall stabilization, 326 
warm plasma dispersion relation 
(electrostatic), 234 
water beading, 362 
wave 
Alfven, 131 
beating, 398 
Bernstein, 236 
definition of cold plasma wave, 
178 
drift, 249 
dust acoustic, 491 
dust Alfven, 504 
dust ion acoustic, 494 
dust whistler, 505 
electron-plasma, 129 
electrostatic ion cyclotron, 263 
energy equation, 219, 221 
energy, finite temperature wave, 
224 
extraordinary, 186 
frequency dispersion, 201 
ion acoustic, 129 
magnetized cold plasma disper- 
sion, 181 
negative energy, 225 
ordinary, 186 
quasi-electrostatic cold plasma, 
203 
surface, 214 
whistler, 200 
wave energy 
in diocotron mode, 468 
wave normal surface, 188, 191 
wave-wave nonlinearity, 428 
waveguide, plasma, 214 
wheel wave normal surface, 191 
whistler wave, 200, 207 
Wigner-Seitz radius, 487 
WKB approximation, 66 
WKB criterion, 212 
WKB mode 


correspondence to saddle point, work function, 29 


245 writhe and twist, 351, 356 
WKB solution, 211 
WKB solutions X-point, 379 
of mode conversion problem, 249 
Woltjer-Taylor relaxation, 341 Yukawa solution, 9, 24, 495, 498 
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